ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` #15343

bjacob · 2023-10-30T19:56:43Z

s16 * u4 -> s32 is the main goal in #15158, and s16 * s16 -> s32 is seen as a less-optimized but generically useful path to run generic "some non-8-bit quantized matmuls that we don't have super specialized code for" on.

hanhanW

I'm not familiar with the structure but the pieces look good to me.

hanhanW · 2023-11-01T01:38:27Z

runtime/src/iree/builtins/ukernel/arch/arm_64/mmt4d_arm_64_entry_point.c

+    case iree_uk_mmt4d_type_s16s16s32:
+      return 0;
+    case iree_uk_mmt4d_type_s16u4s32:
+      return 0;


I'm not familiar with these. It looks like it is returning a null function pointer. Should we add a TODO?

Yes this is returning a null function pointer, which in this context means "We do not have an arm64-optimized code path for this", which the caller will use to fall back to generic code.

…> s32` (iree-org#15343) `s16 * u4 -> s32` is the main goal in iree-org#15158, and `s16 * s16 -> s32` is seen as a less-optimized but generically useful path to run generic "some non-8-bit quantized matmuls that we don't have super specialized code for" on.

ukernel-sub8bit

df62241

bjacob changed the title ~~ukernel-sub8bit~~ ukernels: sub-8-bit support, types s16 * u4 -> s32 and s16 * s16 -> s32 Oct 30, 2023

bjacob marked this pull request as ready for review October 30, 2023 21:42

bjacob requested a review from benvanik as a code owner October 30, 2023 21:42

bjacob requested a review from Max191 October 31, 2023 17:24

bjacob mentioned this pull request Oct 31, 2023

optimized s16s16s32 mmt4d tile functions on x86 #15355

Closed

hanhanW approved these changes Nov 1, 2023

View reviewed changes

bjacob merged commit 3d1d8c8 into iree-org:main Nov 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` #15343

ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` #15343

bjacob commented Oct 30, 2023 •

edited

Loading

hanhanW left a comment

hanhanW Nov 1, 2023

bjacob Nov 1, 2023

ukernels: sub-8-bit support, types s16 * u4 -> s32 and s16 * s16 -> s32 #15343

ukernels: sub-8-bit support, types s16 * u4 -> s32 and s16 * s16 -> s32 #15343

Conversation

bjacob commented Oct 30, 2023 • edited Loading

hanhanW left a comment

Choose a reason for hiding this comment

hanhanW Nov 1, 2023

Choose a reason for hiding this comment

bjacob Nov 1, 2023

Choose a reason for hiding this comment

ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` #15343

ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` #15343

bjacob commented Oct 30, 2023 •

edited

Loading