Script for retrieving updated MARC authority records from external sources.
For Python 3 dependencies see: requirements.txt. You may need to install additional dependencies
for lxml
, see the official Documentation.
Run python3 main.py -h
for instructions.
- There are two supported output formats (MARC and MARCXML).
- There are three sources for authority data supported, namely the iDAI.gazetteer, the Library of Congress and the iDAI.thesauri.
- There are three ways to specify the date marking the oldest updates you are interested in:
- Specify a date in ISO format (YYYY-MM-DD).
- Specify a day(s) offset from your current date.
- Continue from the last day the script was executed (after each run, this date is stored in a log file in the output directory).
- Alternatively, you can run the script without specifying a start date, creating a complete data dump.