bayes_snapper.npy outdated and undocumented #7

geissdoerfer · 2024-02-19T23:54:10Z

The repository contains a bayes_snapper.npy that apparently contains a bayesian model to rate satellite quality. Unfortunately the model seems to have been generated with an outdated scikit version that is not compatible anymore with recent Python. Would you be able to provide the data and method to generate the model? This would benefit reproducibility. Thanks!

The text was updated successfully, but these errors were encountered:

JonasBchrt · 2024-02-20T09:44:55Z

I will have a look - give me a few days.

Some preliminary notes:

I have a dataset with SnapperGPS data from different scenarios with all satellites labelled whether they are visible or not (based on an elevation threshold) and whether their pseudorange was useful to estimate a position or is an outlier. In addition, it contains the estimated SNRs of all satellites.
Then - for each GNSS separately - I fit a Bayes classifier that maps SNRs to a binary label good satellite/bad satellite.
I use GaussianNB from sklearn.naive_bayes for fitting.
This assumes that SNRs are Gaussian distributed given the label, which is obviously an approximation.

This is from my thesis:

At first, it derives a prior probability P (vi = 1|SNRi) for each satellite observation
i ∈ 1, . . . N to be reliable, i.e., to be a so-called inlier, given the associated SNR.
The distribution p (SNRi) of the SNRs is modelled as a Gaussian mixture model
with two components, p (SNRi|vi = 1) for the inliers and p (SNRi|vi = 0) for the
outliers. Mean, standard deviation, and prior of each component are fitted to a
training dataset. This is done separately for each GNSS since the GPS L1 signal,
the Galileo E1 signal, and the BeiDou B1C signal have different properties and,
therefore, differently distributed SNRs. Using the resulting probabilistic models and
Bayes’ rule, the priors P (vi = 1|SNRi) = p(SNRi|vi=1)P(vi=1)/p(SNRi) for each satellite to be
an inlier and P (vi = 0|SNRi) = 1 − P (vi = 1|SNRi) to be an outlier are obtained.

Footnote:

Technically, SNRs are strictly positive while a Gaussian distribution’s support includes all
non-positive numbers, too. However, a Gaussian distribution is chosen because the probability
contained in the distribution’s tail that extends into the negative numbers is negligibly small
for the considered problem in practice. In addition, efficient algorithms for interference exist for
Gaussian distributions.

geissdoerfer · 2024-02-22T02:14:33Z

Thanks for the explanation, it makes sense. The table of labeled training data (csv?) and a script to train the model in the repository would be very helpful!

sunmouren · 2024-11-12T09:47:37Z

I will have a look - give me a few days.

Some preliminary notes:

I have a dataset with SnapperGPS data from different scenarios with all satellites labelled whether they are visible or not (based on an elevation threshold) and whether their pseudorange was useful to estimate a position or is an outlier. In addition, it contains the estimated SNRs of all satellites.

Then - for each GNSS separately - I fit a Bayes classifier that maps SNRs to a binary label good satellite/bad satellite.

I use GaussianNB from sklearn.naive_bayes for fitting.

This assumes that SNRs are Gaussian distributed given the label, which is obviously an approximation.

This is from my thesis:

At first, it derives a prior probability P (vi = 1|SNRi) for each satellite observation
i ∈ 1, . . . N to be reliable, i.e., to be a so-called inlier, given the associated SNR.
The distribution p (SNRi) of the SNRs is modelled as a Gaussian mixture model
with two components, p (SNRi|vi = 1) for the inliers and p (SNRi|vi = 0) for the
outliers. Mean, standard deviation, and prior of each component are fitted to a
training dataset. This is done separately for each GNSS since the GPS L1 signal,
the Galileo E1 signal, and the BeiDou B1C signal have different properties and,
therefore, differently distributed SNRs. Using the resulting probabilistic models and
Bayes’ rule, the priors P (vi = 1|SNRi) = p(SNRi|vi=1)P(vi=1)/p(SNRi) for each satellite to be
an inlier and P (vi = 0|SNRi) = 1 − P (vi = 1|SNRi) to be an outlier are obtained.

Footnote:

Technically, SNRs are strictly positive while a Gaussian distribution’s support includes all
non-positive numbers, too. However, a Gaussian distribution is chosen because the probability
contained in the distribution’s tail that extends into the negative numbers is negligibly small
for the considered problem in practice. In addition, efficient algorithms for interference exist for
Gaussian distributions.

There is a simple solution. You only need to modify the attributes of the old version model to the attributes of the new version model.

JonasBchrt added the documentation Improvements or additions to documentation label Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bayes_snapper.npy outdated and undocumented #7

bayes_snapper.npy outdated and undocumented #7

geissdoerfer commented Feb 19, 2024

JonasBchrt commented Feb 20, 2024

geissdoerfer commented Feb 22, 2024

sunmouren commented Nov 12, 2024

bayes_snapper.npy outdated and undocumented #7

bayes_snapper.npy outdated and undocumented #7

Comments

geissdoerfer commented Feb 19, 2024

JonasBchrt commented Feb 20, 2024

geissdoerfer commented Feb 22, 2024

sunmouren commented Nov 12, 2024