Skip to content

A Python script for parsing the names of UK honours recipients

Notifications You must be signed in to change notification settings

GrantQuick/HonoursListParser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

HonoursListParser

A Python script for parsing the names of UK honours recipients.

In the csv files hosted by gov.uk, the names of individuals in receipt of awards are all listed in a single column which makes cross-referencing against existing databases tricky. The script aims to separate out the prefixes, first names, middle names, surnames and suffixes into the appropriate fields of a destination csv, for easy matching against, or inport into, existing database systems.

Getting Started

UK honours lists are available from gov.uk.

Running the script

  1. Update the value of source_file_full_path to point to a csv file obtained from gov.uk containing a list of honours recipients
  2. Define the output file by updating the value of output_file_full_path
  3. Run the script

Known Issues

The script is unlikely to be comprehensive in that it may not correctly identify all possible suffixes and prefixes, and therefore produce invalid data for some individuals, but it will be accurate for the vast number of honours recipients. Additionally, modifications to the values in the suffixes and prefixes lists should resolve any problem records.

Authors

About

A Python script for parsing the names of UK honours recipients

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages