Merge OpenAI Triton commit `f436c9e` #3124

whitneywhtsang · 2025-01-09T15:50:49Z

This PR change the Triton base from 51dddd3 to f436c9e (Jan 8).
Pass rate: 99.86%

Please do not squash and merge this PR.

It was reported that triton compilation times have heavily increased lately. The cause of this is that we very often create the associated LL to check properties of a given Layout. We do this thousands of times, and this gets very expensive. In this PR, we implement a thread-safe cache for LinearLayouts. We clear this cache after we are done with the TTGIR -> LLVM conversion. In the future, we will make `DistributedEncoding` inherit from `LinearLayoutEncoding`, which will mean that `DistributedEncoding`s will always have access to their associated LinearLayout. Even in this scenario I still think that caching will be good, as there is no real 1-to-1 correspondence between `DistributedEncoding`s and `LinearLayout`s due to broadcasting, where we tile a layout along the tensor or we make it smaller. As such, I think this cache may be also useful in the future.

Currently, torch is required for importing triton and performing autotuning. This seems like a relatively heavy runtime dependency in the context of the cpu backend, as numpy can easily be used instead. Opening here as suggested in triton-lang/triton-cpu#205 to minimize future merge conflicts. Ideally there would be a test for this, but with the cpu backend out-of-tree this seems hard to test. See also triton-lang/triton-cpu#204, triton-lang/triton-cpu#205. # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [x] This PR does not need a test because not (currently) easy to test and basic functionality should be covered by existing tests. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

This reverts commit c81ed27.

Unifying lowering path of `local_alloc` with `local_store` for the case shared mem layout has `leadingOffset`.

lezcano and others added 4 commits January 8, 2025 14:22

Revert "Reverting #5389 (#5528)" (#5555)

70359fa

This reverts commit c81ed27.

Adding support for local_store lowering with stmatrix (#5556)

f436c9e

Unifying lowering path of `local_alloc` with `local_store` for the case shared mem layout has `leadingOffset`.

whitneywhtsang requested a review from pbchekin January 9, 2025 15:50

whitneywhtsang self-assigned this Jan 9, 2025

pbchekin approved these changes Jan 9, 2025

View reviewed changes

whitneywhtsang force-pushed the whitneywhtsang/merge branch from 81721af to 6d600e2 Compare January 9, 2025 17:13

Merge commit 'f436c9ec497e3c39a94340bb0796d65aa4782bf0'

a15b458

whitneywhtsang force-pushed the whitneywhtsang/merge branch from 6d600e2 to a15b458 Compare January 9, 2025 17:20

whitneywhtsang marked this pull request as ready for review January 9, 2025 18:11

whitneywhtsang merged commit a15b458 into main Jan 9, 2025
5 of 6 checks passed

whitneywhtsang deleted the whitneywhtsang/merge branch January 9, 2025 18:26

whitneywhtsang mentioned this pull request Jan 9, 2025

Merge OpenAI Triton till Jan 18th #3091

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge OpenAI Triton commit `f436c9e` #3124

Merge OpenAI Triton commit `f436c9e` #3124

whitneywhtsang commented Jan 9, 2025

Merge OpenAI Triton commit f436c9e #3124

Merge OpenAI Triton commit f436c9e #3124

Conversation

whitneywhtsang commented Jan 9, 2025

Merge OpenAI Triton commit `f436c9e` #3124

Merge OpenAI Triton commit `f436c9e` #3124