-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use jaro_winkler similarity instead of rapidfuzz #491
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #491 +/- ##
==========================================
- Coverage 83.12% 83.11% -0.02%
==========================================
Files 104 104
Lines 6412 6420 +8
==========================================
+ Hits 5330 5336 +6
- Misses 1082 1084 +2 ☔ View full report in Codecov by Sentry. |
""" | ||
|
||
# We can give a dictionary of {object: string used for the comparison} to the extract function | ||
# https://maxbachmann.github.io/RapidFuzz/Usage/process.html#extract | ||
|
||
# TODO: we may want to cache this object. Its generation may take some time if there is a big user base |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Isn't it the time to do it ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of caching we may explore the list generation in the database
https://docs.sqlalchemy.org/en/20/core/functions.html#sqlalchemy.sql.functions.concat
122c315
to
a9c42e7
Compare
a6f73f5
to
eb7e4e5
Compare
eb7e4e5
to
ed9de87
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice!
ed9de87
to
cb09030
Compare
Description
Use jaro_winkler similarity from jellyfish module instead of rapidfuzz. It has the same speed and better results