-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Croissant tag missing in some Croissant supported datasets #3135
Comments
allenai/c4 has this error when listing the compatible libraries:
This error happens in If we manage to fix, it will show |
For example the following dataset:
https://huggingface.co/datasets/allenai/c4
Lacks a Croissant tag, not just in the UI but also if filtering by "library:mlcroissant" with the API. However, the Croissant file is available in the API:
https://huggingface.co/api/datasets/allenai/c4/croissant
When looking at the 15k most download HF datasets, around 4k were lacking this tag. Sometimes this might be justified due to a faulty DatasetInfo, but that's not always the case as we have seen with allenai/c4.
fyi @lhoestq
The text was updated successfully, but these errors were encountered: