[AMP] refine AMP and the corresponding tests for bfloat16 #12787

yangulei · 2022-09-15T01:21:00Z

This PR fixes issue #12763, where some OP are marked to keep the original dtype but some of its input is bfloat16 while a Cast is missing.
The AMP tests have also been refined to cover bfloat16 without accuracy checking.

Update:
The accuracy checking in test_dnnl.py of bf16 vs fp32 is unstable and error-prone. Thus the accuracy checking is ignored if only one bf16 result present, i.e. only compare bf16 vs bf16 and fp32 vs fp32.

billishyahao · 2022-09-15T07:08:38Z

@tvm-bot rerun

billishyahao · 2022-09-15T07:48:37Z

Thanks for the patch, Youlei! I found a bunch of statement like "op->dtype.is_float() || op->dtype.is_bfloat16()" in tvm folder

Shall we simply add new float type definition in tvm/include/tvm/runtime/data_type.h to eliminate those statements?

  /*! \return whether type is a general float type, including float/float16/bfloat16. */
  bool is_general_float() const { return is_float() || is_bfloat16(); }

yangulei · 2022-09-16T00:48:32Z

@billishyahao
I agree that is_general_float() could make the code cleaner, but not clearer. general float is a broader concept rather than IEEE float point plus bfloat16, for example, TensorFloat-32 is also a general float. I prefer to keep the expr like op->dtype.is_float() || op->dtype.is_bfloat16() as its more clear and specific.

yangulei · 2022-10-27T08:25:52Z

@masahi Could you help to review this? Thanks.

* refine AMP for bfloat16 * refine AMP tests to cover bfloat16 * refine accuracy checking for dnnl bf16

yangulei added 2 commits September 14, 2022 16:59

refine AMP for bfloat16

3d08ee1

refine AMP tests to cover bfloat16

d9005be

yangulei mentioned this pull request Sep 15, 2022

[Bug][DNNL][BYOC] Transform Pass From float32 to bfloat16 failed in some patterns #12763

Closed

refine accuracy checking for dnnl bf16

9ea7b7e

yangulei force-pushed the fix_amp branch from f466fdc to 9ea7b7e Compare September 16, 2022 02:07

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

masahi approved these changes Oct 27, 2022

View reviewed changes

masahi merged commit 5c9066d into apache:main Oct 27, 2022

xinetzone pushed a commit to daobook/tvm that referenced this pull request Nov 10, 2022

[AMP] refine AMP and the corresponding tests for bfloat16 (apache#12787)

5641f06

* refine AMP for bfloat16 * refine AMP tests to cover bfloat16 * refine accuracy checking for dnnl bf16

xinetzone pushed a commit to daobook/tvm that referenced this pull request Nov 25, 2022

[AMP] refine AMP and the corresponding tests for bfloat16 (apache#12787)

f0e858a

* refine AMP for bfloat16 * refine AMP tests to cover bfloat16 * refine accuracy checking for dnnl bf16

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMP] refine AMP and the corresponding tests for bfloat16 #12787

[AMP] refine AMP and the corresponding tests for bfloat16 #12787

yangulei commented Sep 15, 2022 •

edited

Loading

billishyahao commented Sep 15, 2022

billishyahao commented Sep 15, 2022

yangulei commented Sep 16, 2022

yangulei commented Oct 27, 2022

[AMP] refine AMP and the corresponding tests for bfloat16 #12787

[AMP] refine AMP and the corresponding tests for bfloat16 #12787

Conversation

yangulei commented Sep 15, 2022 • edited Loading

billishyahao commented Sep 15, 2022

billishyahao commented Sep 15, 2022

yangulei commented Sep 16, 2022

yangulei commented Oct 27, 2022

yangulei commented Sep 15, 2022 •

edited

Loading