This code collects congressional/parliamentary dataset across US, UK and Canada. The dataset is hosted in the hugging face repo https://huggingface.co/datasets/hazylavender/CongressionalDataset
This code collects biorxiv abstracts under liscence 'cc_by_nc_nd', 'cc_by_nd', 'cc_by_nc', 'cc_by', 'cc0'
. The dataset is hosted in the hugging face repo https://huggingface.co/datasets/hazylavender/biorxiv-abstract