-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Access to torchvision models training files? #615
Comments
I also think that it would be great if the training procedure was part of the repository. Here are a few advantages I see:
|
the training file is https://github.com/pytorch/examples/tree/master/imagenet |
@soumith thanks for the info, this is used for all datasets (including cifar10 cifar100) without changing any default values for those datasets? |
@arnaghizadeh we dont provide cifar10/100 pre-trained models |
@soumith oh I see based on your comment in the code "If True, returns a model pre-trained on ImageNet" the option is only for imagenet. However, your documentation I think should be a little more clear and emphasize that this feature is exclusive for imagenet, I automatically supposed that this feature supports all major datasets. https://pytorch.org/docs/0.4.0/torchvision/models.html?highlight=densenet. |
@sotte I agree with you that all models should have a clearly specified training procedure. Any suggestions on how to include this training information is more than welcome |
@fmassa Great, I'm very happy to see #645! Option 1 Option 2 Independent on the two options, https://pytorch.org/docs/stable/torchvision/models.html should mention how models are trained. I can submit a PR for this if you want and we decided on how to proceed with the training procedure setup. |
@sotte I totally agree with you, and I actually think that option 2 will be the way to go. It would probably involve having a separate repo which would contain all the training logic, with a set of configuration files (maybe à-la https://github.com/facebookresearch/maskrcnn-benchmark/tree/master/configs) that entirely specify how to train a model. This way, we have a simple and reproducible way of obtaining the models. cc @bermanmaxim @soumith for feedback |
@fmassa I agree, in fact I was myself recently wondering about the training specifications of the pretrained models. I think following |
Can we do anything to help out and speed up the process? |
Hi @sotte Sorry for the delay in replying, I was pretty busy with other projects. Yes, having some help would be awesome! I discussed with @soumith about this some time ago, and he mentioned that the best would be to have, for each model:
For most models, it all boils down to So, if someone could start organizing such a structure, potentially in a And this would as well open the door to adding new tasks into torchvision, which is currently very classification-based. Thoughts? |
@fmassa: If you can provide the actual scripts used, it would be easier to refactor into components/scripts suitable for the torchvision repo. |
@Froskekongen for now, most of the trainings have been done with a variant of https://github.com/pytorch/examples/blob/master/imagenet/main.py |
We now provide reference training scripts for classification, detection and segmentation under the |
Hi, In the torchvision models https://github.com/pytorch/vision/tree/master/torchvision/models, we can see the models, we also have the option to download pretrained models, however, what is missing (if I'm not mistaken) is the training files that uses those models. I would really want to have access to those files (specially densenet) so I can reproduce pretrained models myself. Is it possible to have access to these files or at least the exact configurations(e.g, learning rates, epochs, etc) of the training files?
The text was updated successfully, but these errors were encountered: