Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't download the dogc vs non-dogs dataset #7

Closed
k4ntz opened this issue Mar 21, 2022 · 4 comments
Closed

Can't download the dogc vs non-dogs dataset #7

k4ntz opened this issue Mar 21, 2022 · 4 comments

Comments

@k4ntz
Copy link

k4ntz commented Mar 21, 2022

Hi. I am trying to reproduce and use your dogs vs non-dogs dataset. I followed the link to download it but the archive file seems to be broken.
I have read this issue, where you explain how you created the dataset.
in it, you declare using 50K images :

  • Dogs-50A-train: 50K images 1000 images from ImageNet Training Set for each 50 classes (I guess, because you wrote 100). Another problem is that for certain classes, you only have ~700 images.
  • Dogs-50A-val: 10K images (so I guess 200 images for each 50 classes). You write once that you take it from ImageNet validation (in the table) and then from the training set. The validation set only contain 50 images per class. I thus think that you might have taken these images from the training set. Am I right?

Would you have a link to a non-broken archive ? Or could you please clear my above concerns such that I can generate an equivalent one ?

@romain-xu-darme
Copy link

Hi. I'm facing the same issue right now. Did you ever get a link to non-broken files? @andrehuang @PKUCSS Not sure if it helps but here are the checksum of the available files
md5sum dogs50A-train.tar.gz 859653679d267b902666c7d629a44a0d
md5sum non-dogs-val.tar.gz 62d737cf91c7909f3d7796e701817a40

@andrehuang
Copy link
Collaborator

Hi, sorry for the late response. Thanks for your interest in our work. The authors of this paper have either left the company or graduated, so it's not watched very promptly.
For the file, it's a bit weird that it cannot be unzipped. But I'm not sure whether we have another copy offline now. @PKUCSS Sishuo, do you have one?
Also, for the creation of the dataset, Sishuo, can you clarify a bit? Thanks!

@PKUCSS
Copy link
Collaborator

PKUCSS commented Jul 26, 2022

Sorry that I don't store a local copy of the dataset or the code for construction after I finished my internship at Megvii, so currently I'm not able to check the details. We will try to recover the dataset and clarify the details if a local copy can be found.

@romain-xu-darme
Copy link

Ok, thanks a lot!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants