We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在trainer中添加了eval_dataset,写了compute_metric函数来计算eval中的一些指标,比如funtion calling的precision/recall和回复文本的bleu score。
遇到问题,evaluate时内存暴增,本来训练时10+GB显存占用,到了eval时突然增到60GB+,最终增到OOM
请问你有遇到过类似的情况吗?
The text was updated successfully, but these errors were encountered:
大概是eval的时候没有加载lora
Sorry, something went wrong.
那也挺奇怪,加载lora应该不费多少显存的;而且这个问题在pt2上也存在,eval也会爆显存
No branches or pull requests
在trainer中添加了eval_dataset,写了compute_metric函数来计算eval中的一些指标,比如funtion calling的precision/recall和回复文本的bleu score。
遇到问题,evaluate时内存暴增,本来训练时10+GB显存占用,到了eval时突然增到60GB+,最终增到OOM
请问你有遇到过类似的情况吗?
The text was updated successfully, but these errors were encountered: