Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to set up the adversarial training? #141

Open
Dreamkeeper66666 opened this issue May 5, 2024 · 1 comment
Open

How to set up the adversarial training? #141

Dreamkeeper66666 opened this issue May 5, 2024 · 1 comment

Comments

@Dreamkeeper66666
Copy link

Dreamkeeper66666 commented May 5, 2024

Hi, I just tried to have the cyclic-adv-s545 model play against the latest 28b model. However, it doesn't seem to work very well. And I would like to do some fine tuning on my own. I just saw some scripts under kubernetes folder, but I don't really know how to make it run locally, so is there any instructions on setting up the iterative adversarial training on a local machine? By the way are there some more recent models? Thanks! @AdamGleave @tomtseng @ed1d1a8d

@tomtseng
Copy link
Contributor

Right, I don't expect cyclic-adv-s545m to work well against 28b since KataGo has been adversarially trained against cyclic positions from cyclic-adv-s545m, though I would guess that if you spent several hundred GPU-days fine-tuning that you could get an attack against 28b.

I added some more detailed instructions in the README now and uploaded adversaries to Google Drive. I should note the scale of these experiments is a bit high (several hundred GPU-days) so if you don't have access to several GPUs you might find it hard to make much training progress.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants