[ONNX] Add MatMulInteger16 contrib op #9186

abhikran-quic · 2021-10-04T10:02:25Z

This PR adds support for contrib op: com.microsoft.MatMulInteger16.

Converge MatMul and MatmulInteger16 ops to use a single function with output dtype.

python/tvm/relay/frontend/onnx.py

… functions

tests/python/frontend/onnx/test_forward.py

tmoreau89 · 2021-10-05T16:51:36Z

Thanks for helping with the review @cconvey ! CC-ing @mbrookhart @anwang2009 @AndrewZhaoLuo

tests/python/frontend/onnx/test_forward.py

python/tvm/relay/frontend/onnx.py

AndrewZhaoLuo · 2021-10-20T21:39:46Z

@abhikran-quic do you still plan on working on this?

abhikran-quic · 2021-10-21T01:47:20Z

@abhikran-quic do you still plan on working on this?

Hi @AndrewZhaoLuo : I've been out of office since last two weeks and hence I've not been able to reply here. Sorry about this!
I will fix the changes that you've suggested once I'm back.

AndrewZhaoLuo · 2021-10-22T05:30:24Z

No problem, have a good vacation!

tmoreau89 · 2021-11-15T22:52:11Z

@abhikran-quic friendly ping to see if updating this PR is currently blocked, thanks!

abhikran-quic · 2021-11-21T07:39:52Z

@abhikran-quic https://github.com/apache/tvm/pull/9540/files should fix this. You can take these changes in this PR and I can close mine.

Thank you @AndrewZhaoLuo for your help on this. I've incorporated the changes mentioned by you in the latest patch.

However, there's one more pending issue. After removing test_matmulinteger from unsupported_onnx_tests, I saw an error from onnx.py . Here is the link to CI where failure is seen.

IMHO, when we remove test_matmulinteger the error is expected because there is no op named matmulinteger in onnx.py. Onnxruntime supports MatMulInteger16 and MatMulIntegerToFloat and hence we can add ops with the names mentioned in Onnx runtime documentation. Hence, removing test_matmulinteger is leading to the error.

In my latest patch, I've retained test_matmulinteger in unsupported_onnx_tests. Please share your thoughts on this.

AndrewZhaoLuo · 2021-11-22T20:15:38Z

Ah yes, MatMulInteger is an onnx op but it is seperate from MatMulInteger16

AndrewZhaoLuo

LGTM

abhikran-quic · 2021-11-23T12:35:18Z

LGTM

Thank you @AndrewZhaoLuo for your review.

I would request reviewers to please review this PR.

abhikran-quic · 2021-11-24T13:34:04Z

Hello Reviewers,

Could you please review this PR for any more changes needed ?

tmoreau89

LGTM

tmoreau89 · 2021-11-24T16:49:37Z

Thank you @abhikran-quic, @AndrewZhaoLuo, @cconvey the PR has been merged

* [ONNX] Add MatMulInteger16 contrib op * Fix formatting errors * Remove a code comment and do not set default value of nd * Move flatten_to_nd function outside matmul to be used across multiple functions * Add function docstring and describe the tests * Use max/min value of int16 as high/low while generating input vectors * Converge MatMul and MatMulInteger16 ops into a single op using output dtype * Fix indentation issues * Formatting changes * Fix CUDA batchmatmul strategy to allow mixed precision * Add test_matmulinteger to unsupported_onnx_tests

Icemist · 2022-01-10T01:45:37Z

@abhikran-quic
Could I please ask why did you delete the code branch under the ONNX_DEFAULT_CONFIGS["use_nt_batch_matmul"] parameter?
I found that after this change, some of the code samples stopped working with an error like “Check failed: (reporter->AssertEQ(xk, yk)) is false: BatchDot: shapes of x and y is inconsistent, x shape=[16, 384, 384], y shape=[16, 384, 64]“
So now I see ONNX_DEFAULT_CONFIGS is not used by anyone. But The same logic with use_nt_batch_matmul is there for tensorflow.
Can we restore this logic? Will it affect the MatMulInteger16 operation you added?

* [ONNX] Add MatMulInteger16 contrib op * Fix formatting errors * Remove a code comment and do not set default value of nd * Move flatten_to_nd function outside matmul to be used across multiple functions * Add function docstring and describe the tests * Use max/min value of int16 as high/low while generating input vectors * Converge MatMul and MatMulInteger16 ops into a single op using output dtype * Fix indentation issues * Formatting changes * Fix CUDA batchmatmul strategy to allow mixed precision * Add test_matmulinteger to unsupported_onnx_tests

abhikran-quic · 2022-01-11T07:44:16Z

Hi @Icemist , This change was done to optimize the code path and since all the tests in automation passed, we went ahead and committed the change.
If this is affecting the logic you've mentioned above, we can restore it definitely.

* [ONNX] Add MatMulInteger16 contrib op * Fix formatting errors * Remove a code comment and do not set default value of nd * Move flatten_to_nd function outside matmul to be used across multiple functions * Add function docstring and describe the tests * Use max/min value of int16 as high/low while generating input vectors * Converge MatMul and MatMulInteger16 ops into a single op using output dtype * Fix indentation issues * Formatting changes * Fix CUDA batchmatmul strategy to allow mixed precision * Add test_matmulinteger to unsupported_onnx_tests

[ONNX] Add MatMulInteger16 contrib op

b2228c0

abhikran-quic requested review from areusch, comaniac, Huyuwei, jroesch, junrushao, jwfromm, kazum, mbrookhart, merrymercy, siju-samuel, srkreddy1238, tqchen and yzhliu as code owners October 4, 2021 10:02

Fix formatting errors

d6503bf

abhikran-quic mentioned this pull request Oct 4, 2021

[Tracking Issue][ONNX] Quantized operator support in ONNX importer #8838

Closed

cconvey reviewed Oct 4, 2021

View reviewed changes

python/tvm/relay/frontend/onnx.py Outdated Show resolved Hide resolved

cconvey reviewed Oct 4, 2021

View reviewed changes

python/tvm/relay/frontend/onnx.py Outdated Show resolved Hide resolved

abhikran-quic added 2 commits October 4, 2021 09:06

Remove a code comment and do not set default value of nd

77c1be7

Move flatten_to_nd function outside matmul to be used across multiple…

ccbe433

… functions

cconvey reviewed Oct 4, 2021

View reviewed changes

tests/python/frontend/onnx/test_forward.py Outdated Show resolved Hide resolved

Add function docstring and describe the tests

69f2453

cconvey reviewed Oct 5, 2021

View reviewed changes

tests/python/frontend/onnx/test_forward.py Outdated Show resolved Hide resolved

AndrewZhaoLuo reviewed Oct 7, 2021

View reviewed changes

python/tvm/relay/frontend/onnx.py Show resolved Hide resolved

python/tvm/relay/frontend/onnx.py Outdated Show resolved Hide resolved

python/tvm/relay/frontend/onnx.py Outdated Show resolved Hide resolved

Use max/min value of int16 as high/low while generating input vectors

1aa71eb

Fix CUDA batchmatmul strategy to allow mixed precision

d5d7b30

abhikran-quic requested review from anijain2305, MarisaKirisame, slyubomirsky, vinx13, wweic, zhiics and ZihengJiang as code owners November 21, 2021 07:36

Merge branch 'main' into onnx-matmulinteger16

c17cb80

Add test_matmulinteger to unsupported_onnx_tests

c0b868f

AndrewZhaoLuo approved these changes Nov 22, 2021

View reviewed changes

tmoreau89 approved these changes Nov 24, 2021

View reviewed changes

tmoreau89 merged commit 9c4e9ff into apache:main Nov 24, 2021

abhikran-quic deleted the onnx-matmulinteger16 branch November 24, 2021 17:35

Icemist mentioned this pull request Jan 13, 2022

Restore the use of ONNX_DEFAULT_CONFIGS["use_nt_batch_matmul"] #9925

Merged

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Add MatMulInteger16 contrib op #9186

[ONNX] Add MatMulInteger16 contrib op #9186

abhikran-quic commented Oct 4, 2021 •

edited

Loading

tmoreau89 commented Oct 5, 2021

AndrewZhaoLuo commented Oct 20, 2021

abhikran-quic commented Oct 21, 2021 •

edited

Loading

AndrewZhaoLuo commented Oct 22, 2021

tmoreau89 commented Nov 15, 2021

abhikran-quic commented Nov 21, 2021 •

edited

Loading

AndrewZhaoLuo commented Nov 22, 2021

AndrewZhaoLuo left a comment

abhikran-quic commented Nov 23, 2021

abhikran-quic commented Nov 24, 2021

tmoreau89 left a comment

tmoreau89 commented Nov 24, 2021

Icemist commented Jan 10, 2022

abhikran-quic commented Jan 11, 2022

[ONNX] Add MatMulInteger16 contrib op #9186

[ONNX] Add MatMulInteger16 contrib op #9186

Conversation

abhikran-quic commented Oct 4, 2021 • edited Loading

tmoreau89 commented Oct 5, 2021

AndrewZhaoLuo commented Oct 20, 2021

abhikran-quic commented Oct 21, 2021 • edited Loading

AndrewZhaoLuo commented Oct 22, 2021

tmoreau89 commented Nov 15, 2021

abhikran-quic commented Nov 21, 2021 • edited Loading

AndrewZhaoLuo commented Nov 22, 2021

AndrewZhaoLuo left a comment

Choose a reason for hiding this comment

abhikran-quic commented Nov 23, 2021

abhikran-quic commented Nov 24, 2021

tmoreau89 left a comment

Choose a reason for hiding this comment

tmoreau89 commented Nov 24, 2021

Icemist commented Jan 10, 2022

abhikran-quic commented Jan 11, 2022

abhikran-quic commented Oct 4, 2021 •

edited

Loading

abhikran-quic commented Oct 21, 2021 •

edited

Loading

abhikran-quic commented Nov 21, 2021 •

edited

Loading