Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Interactive Labeller - Does not match at probability 0.5 #134

Closed
sonalgoyal opened this issue Jan 18, 2022 · 3 comments
Closed

Interactive Labeller - Does not match at probability 0.5 #134

sonalgoyal opened this issue Jan 18, 2022 · 3 comments
Assignees
Labels
good first issue to start contributing to Zingg

Comments

@sonalgoyal
Copy link
Member

During model building of 5M NC dataset, saw some messages on the interactive learner - what does do not match with probability 0.5 mean?
Screenshot from 2022-01-14 20-55-06

@sonalgoyal sonalgoyal added the good first issue to start contributing to Zingg label Jan 21, 2022
@navinrathore
Copy link
Contributor

navinrathore commented Feb 15, 2022

The typical examples of the concerned statement are (with complete score value)

Zingg predicts the above records DO NOT MATCH with a similarity score of 0.000000
Zingg predicts the above records DO NOT MATCH with a similarity score of 0.361295
Zingg predicts the above records DO NOT MATCH with a similarity score of 0.486515
Zingg predicts the above records DO NOT MATCH with a similarity score of 0.499993
Zingg predicts the above records MATCH with a similarity score of 1.000000
Zingg predicts the above records MATCH with a similarity score of 0.518476
Zingg predicts the above records MATCH with a similarity score of 0.999999

As, currently in code, the precision for the "similarity score" is set to two decimal points. the value is rounded. e.g, 0.499993 gets converted to 0.50. Hence, on console, the message says "... above records DO NOT MATCH with a similarity score of 0.50".

@sonalgoyal
Copy link
Member Author

please make it a % score to 2 dec places then

@sonalgoyal
Copy link
Member Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue to start contributing to Zingg
Projects
None yet
Development

No branches or pull requests

2 participants