Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove "gated datasets unlock" logic #189

Merged
merged 5 commits into from
Apr 1, 2022
Merged

remove "gated datasets unlock" logic #189

merged 5 commits into from
Apr 1, 2022

Conversation

severo
Copy link
Collaborator

@severo severo commented Mar 30, 2022

the new tokens are not related to a user and have read access to the gated datasets (right, @SBrandeis?).

also: add two tests to ensure the gated datasets can be accessed

also: add two tests to ensure the gated datasets can be accessed
@severo severo requested a review from SBrandeis March 30, 2022 14:48
@severo severo marked this pull request as draft March 30, 2022 15:58
@severo severo removed the request for review from SBrandeis March 30, 2022 15:58
I changed
(https://huggingface.co/datasets/severo/dummy_gated/commit/99194748bed3625a941aaf785740df02ca5762c9)
severo/dummy_gated to a simpler dataset, without a python script, to
avoid having non-related errors. Also in the commit: load the HF_TOKEN
from a secret in
https://github.com/huggingface/datasets-preview-backend/settings/secrets/actions
to be able to run the unit tests
@severo severo requested a review from SBrandeis March 30, 2022 17:19
@severo
Copy link
Collaborator Author

severo commented Mar 30, 2022

@SBrandeis I added you as a reviewer because it's part of the migration to the new tokens. I use it only to "unlock" the gated datasets (and later on, authenticate so that the server knows the dataset is unlocked). We don't access private datasets for now, so not related to public/private for now.

@severo severo marked this pull request as ready for review March 30, 2022 17:23
@severo
Copy link
Collaborator Author

severo commented Mar 30, 2022

By the way: what is the relationship between the token you gave me for dataset-preview-backend, and the gated datasets? Is the gated dataset unlocked for the token? for the user or org related to the token?

@severo
Copy link
Collaborator Author

severo commented Mar 30, 2022

unit tests should be failing due to #188

@severo severo changed the title refactor: 💡 move gated datasets "unlock" code to models/ remove "gated datasets unlock" logic Apr 1, 2022
severo added 2 commits April 1, 2022 14:54
it's a dependency of lm-dataformat, and last version still depends on a
vulnerable ujson version
the new "app" tokens on moonlanding can read the gated datasets without
having to accept the conditions first, as it occurs for users.

BREAKING CHANGE: 🧨 HF_TOKEN must be an app token
@SBrandeis
Copy link

Tests are failing tho

@severo
Copy link
Collaborator Author

severo commented Apr 1, 2022

@severo severo merged commit 1a6eb0c into main Apr 1, 2022
@severo severo deleted the test-gated branch April 1, 2022 16:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants