Comparing first name using the nicknames package #2206
Ahosseinzadeh723
started this conversation in
Ideas
Replies: 1 comment 4 replies
-
Alternatively, the list of names available at nicknames can be used independently. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm currently facing a challenge in linking individual records due to the use of nicknames (in member_first_name). I found a Python package called nicknames, which includes a database of over a thousand nicknames. I'm considering integrating this package into my data linkage process to improve the accuracy of individual matching.
Could anyone share their experiences or suggest the best practices for incorporating such a package into a data linkage settings? Specifically, I am looking for advice on how to effectively utilize this nickname database in my settings.
For example, "Bill" is short for "William". If record 1: member_first_name is Bill and record 2: member_first_name is William.
which results: {'bela', 'bell', 'bill', 'billy', 'wil', 'will', 'willie', 'willy'}
Using this package I expect the algorithm to consider record 1 and record 2 member_first_name as exact match.
Any insights or examples of similar implementations would be greatly appreciated!
Beta Was this translation helpful? Give feedback.
All reactions