[AMP] Refine found_inf of loss_scaler #37770

zhangbo9674 · 2021-12-01T12:55:12Z

PR types

Performance optimization

PR changes

APIs

Describe

AmpScaler类用于混合精度训练过程中对loss进行缩放，其中成员属性：_found_inf用于标记每轮训练过程中参数梯度是否存在inf。

原本框架代码会在调用check_finite_and_unscaleop通过to_variable申请两个bool类型的tensor，导致每轮训练在该时间存在cudaMemcpy，影响GPU性能：

优化后，将在AmpScaler类初始化过程中声明并定义两个bool类型的tensor，消除训练过程中的cudaMemcpy：

paddle-bot-old · 2021-12-01T12:55:41Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhiqiu

LGTM

refine found_inf of loss_scaler

4a22730

zhiqiu approved these changes Dec 2, 2021

View reviewed changes

zhiqiu merged commit cc2b466 into PaddlePaddle:develop Dec 2, 2021

Zjq9409 pushed a commit to Zjq9409/Paddle that referenced this pull request Dec 10, 2021

refine found_inf of loss_scaler (PaddlePaddle#37770)

507cb48

zhangbo9674 deleted the dev/loss_scaler_found_inf branch March 2, 2023 02:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMP] Refine found_inf of loss_scaler #37770

[AMP] Refine found_inf of loss_scaler #37770

zhangbo9674 commented Dec 1, 2021

paddle-bot-old bot commented Dec 1, 2021

zhiqiu left a comment

[AMP] Refine found_inf of loss_scaler #37770

[AMP] Refine found_inf of loss_scaler #37770

Conversation

zhangbo9674 commented Dec 1, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Dec 1, 2021

zhiqiu left a comment

Choose a reason for hiding this comment