Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA driver error: invalid argument #53

Open
znavidi opened this issue Jan 27, 2025 · 0 comments
Open

CUDA driver error: invalid argument #53

znavidi opened this issue Jan 27, 2025 · 0 comments

Comments

@znavidi
Copy link

znavidi commented Jan 27, 2025

Thank you for your great work!
I have been trying to fine-tune the 2B model on the toy dataset just to run the fine-tuning script properly and then adapt that to our project. I keep getting CUDA driver error: invalid argument error during runtime and even though I checked to have compatible torch (2.5.1) and cuda (tried 11.8, 12.1, and 12.4), it keeps getting this error.

python 3.12.0

torch version 2.5.1+cu121

flash-attn version 2.6.3

I appreciate any hint on where the issue might come from and how I can fix it. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant