Skip to content

Historical disease database version 1.0.0

Latest
Compare
Choose a tag to compare
@vankesteren vankesteren released this 17 Dec 18:08
0937ede

This is the first stable version of the disease database. It is created by performing regex searches for locations (municipalities) and diseases in historical newspaper articles from 1830-1940. There is not much post-processing, so note that there are likely still several issues. Please be careful when using this dataset, perform your own quality checks. When in doubt, contact us via GitHub.

Further improvements are scheduled for the next versions, such as limiting the location/disease distance in the article text, more advanced text processing, post-hoc space-time smoothing, and more.

What's Changed

Full Changelog: v0.2.0-alpha...v1.0.0