-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix test_tipc/train logic #2879
fix test_tipc/train logic #2879
Conversation
tests/test_tipc/train.py
Outdated
sys.exit() | ||
#if global_step > 100 and global_step < 110: | ||
# paddle.fluid.core.nvprof_nvtx_pop() | ||
# paddle.fluid.core.nvprof_nvtx_push(str(global_step)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
删除
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个 commit 仅作为草稿,跑完110 step 就停下。
tests/test_tipc/train.py
Outdated
reader_cost_avg.record(train_reader_cost) | ||
batch_cost_avg.record(train_batch_cost) | ||
batch_ips_avg.record(train_batch_cost, sample_per_cards) | ||
batch_start = time.time() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这两段代码不能复用的原因是?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
比如 line:151 需要 batch_cost_avg 的数据,所以在 line:145 之前也需要有这么一段代码
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
不过是这段代码同时都在 if...
& else..
里,这段逻辑不能共用么?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
batch_start
是否也可以放到 batch 最后,按理时间戳会准确一些。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
batch_start = time.time() 已放到 batch 最后
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks.
PR types
Others
PR changes
Others
Description
fix xlnet for ips count