-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wikipidea json dump #2
Comments
Hi, the JSON to download is found here: https://dumps.wikimedia.org/wikidatawiki/entities/ |
Thanks ! And is this the xml dump wikidatawiki-20211201-pages-articles-multistream.xml.bz2 (124 GB) |
Or is enwiki-20211201-pages-articles-multistream.xml.bz2 good only for english |
Hi, I would suggest you use the info in the 'naacl' branch: https://github.com/SasCezar/XWikiRE/tree/naacl In the naacl branch, the recommended wikipedia dump is JSON and well. Sorry about the chaos |
hi want to generate wikipidea reading dataset for English. Which specific JSON I should download? And what will be its size after unzipping
The text was updated successfully, but these errors were encountered: