Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can not reproduct the result in Table 2. #16

Open
Zyriix opened this issue Nov 12, 2023 · 5 comments
Open

Can not reproduct the result in Table 2. #16

Zyriix opened this issue Nov 12, 2023 · 5 comments

Comments

@Zyriix
Copy link

Zyriix commented Nov 12, 2023

I run the following training command on 8 v100.

torchrun --standalone --nproc_per_node=8 train.py --outdir=training-runs
--data=datasets/cifar10-32x32.zip --cond=0 --arch=ddpmpp

This gives me a FID at 2.08169 which is still far from FID in the paper(1.97).

I think this may caused by the random seeds(Random init in the code).
Is it possible to share the seed for reprodcuting the result in table2?

Any suggestiones?

@Newbeeer
Copy link

I observed the FID of EDM fluctuates largely. Did you report the lowest FID among checkpoints?

@Zyriix
Copy link
Author

Zyriix commented Nov 13, 2023

I observed the FID of EDM fluctuates largely. Did you report the lowest FID among checkpoints?

I found this in some papers that try to retrain Edm. I got the lowest FID(2.04053) at around 16w ckpt.
I think the fluctuation is caused by lower ema rate and larger learning rate.

@yuanzhi-zhu
Copy link

@Zyriix To calculate FID we have to first generate, saying 50000 samples. The random seed can influence the generation process of these samples hence the FID value I guess

@Zyriix
Copy link
Author

Zyriix commented Nov 18, 2023

@Zyriix To calculate FID we have to first generate, saying 50000 samples. The random seed can influence the generation process of these samples hence the FID value I guess

Most works use the same sample seeds just like this repo:
049999, 5000099999, 100000~149999
and report the lowest one.

@Liyuan-Liu
Copy link

--arch=ncsnpp performs much better than VP architecture

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants