forked from apache/tvm
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ONNX] Add MatMulInteger16 contrib op (apache#9186)
* [ONNX] Add MatMulInteger16 contrib op * Fix formatting errors * Remove a code comment and do not set default value of nd * Move flatten_to_nd function outside matmul to be used across multiple functions * Add function docstring and describe the tests * Use max/min value of int16 as high/low while generating input vectors * Converge MatMul and MatMulInteger16 ops into a single op using output dtype * Fix indentation issues * Formatting changes * Fix CUDA batchmatmul strategy to allow mixed precision * Add test_matmulinteger to unsupported_onnx_tests
- Loading branch information
1 parent
6fa8b30
commit 0258671
Showing
3 changed files
with
133 additions
and
75 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters