Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the data group "remove" in the given "d3_with_clash_info.csv" file? #17

Open
resistzzz opened this issue Mar 26, 2024 · 1 comment

Comments

@resistzzz
Copy link

In the given "d3_with_clash_info.csv" file, I find that number of 1072 samples are tagged as "remove" in the "group" column, rather than "train/valid/test". Why these samples are tagged as "remove"? Are these samples used for training?

@Nobody-Zhang
Copy link

I don't think 'removed' files could be used for training.
There could be multiple reasons... For example, one could not be sanitized?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants