Support unpadded shapes in matmul1d w/ gather_in0 #16626

avoraTT · 2025-01-10T21:06:27Z

Problem

Currently, Matmul1D with gather_in0=True does not handle shapes (K, N) that do not divide by the number of cores in the ring. In the current use case in the Llama models, the activations and weights need to be padded in order to use the matmul. However, this results in significant overhead caused by padding/slicing. Therefore, this matmul must support shapes that do not divide with the number of cores, and handle the padding implicitly.

The text was updated successfully, but these errors were encountered:

…6627) ### Ticket - #16626 ### Problem description In the current use case of Matmul1D with gather_in0 in the Llama models, the activations and weights need to be padded. This results in significant overhead. ### What's changed - Added support to skip part of in0_block_w that is padding information - Pad the Kt and Nt in the host code for gather_in0 ### Checklist - [x] Post commit CI passes (https://github.com/tenstorrent/tt-metal/actions/runs/12893880800) - [x] New/Existing tests provide coverage for changes (https://github.com/tenstorrent/tt-metal/actions/runs/12893883783)

avoraTT added the metal tt-metal issue label Jan 10, 2025

avoraTT self-assigned this Jan 10, 2025

avoraTT mentioned this issue Jan 10, 2025

Add support for unpadded shapes in Matmul1D w/ gather_in0 #16627

Merged

2 tasks

avoraTT closed this as completed Jan 24, 2025

avoraTT added LLMs on Metal tg-llama labels Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support unpadded shapes in matmul1d w/ gather_in0 #16626

Support unpadded shapes in matmul1d w/ gather_in0 #16626

avoraTT commented Jan 10, 2025

Support unpadded shapes in matmul1d w/ gather_in0 #16626

Support unpadded shapes in matmul1d w/ gather_in0 #16626

Comments

avoraTT commented Jan 10, 2025

Problem