-
Notifications
You must be signed in to change notification settings - Fork 135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gpt 2 sweep #140
Gpt 2 sweep #140
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #140 +/- ##
=======================================
Coverage 72.21% 72.21%
=======================================
Files 17 17
Lines 1814 1814
Branches 295 295
=======================================
Hits 1310 1310
Misses 432 432
Partials 72 72 ☔ View full report in Codecov by Sentry. |
looks good! maybe i should make this the default for the ansible example run? (@jbloomAus) also was 200_00 intentional for training steps? Line 23 in af892ae
|
Whoops that’s a typo!
…On Sun, 12 May 2024 at 19:49, johnny ***@***.***> wrote:
looks good! maybe i should make this the default for the ansible example
run? ***@***.*** <https://github.com/jbloomAus>)
also was 200_00 intentional for training steps?
https://github.com/jbloomAus/SAELens/blob/af892aec9e8f280aaaee09feb472274f1aafe20e/scripts/sweep-gpt2.py#L23
—
Reply to this email directly, view it on GitHub
<#140 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AALGU43UQAH2G2CLNID5SSDZB62THAVCNFSM6AAAAABHTAOCQ6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMBWGM2DEMZYGM>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
@tomMcGrath will know better if this is a reasonable example run. My guess is not because of we run this a lot paralleling by a large number will use more compute. |
* sweep settings for gpt2-small * get model string right * fix some comments that don't apply now * formatting fix
Description
Adds base sweep settings for gpt-2-small. Sweeps are intentionally relatively short: most of the action that tells us whether a sweep will be obviously bad seems to happen relatively quickly. Based on Anthropic SAE replication.
W&B dashboard for this run.
Type of change
Please delete options that are not relevant.
Checklist:
You have tested formatting, typing and unit tests (acceptance tests not currently in use)
make check-ci
to check format and linting. (you can runmake format
to format code if needed.)