[RFC] Establish TVM Unity Connection -- A Technical Strategy #91

tqchen · 2022-08-24T13:11:53Z

This RFC summarizes an overall technical strategy on establish TVM Unity connection milestone.
It is supplementary to the technical RFCs about specific components.

rendered
discuss thread

This RFC summarizes an overall technical strategy on establish TVM Unity connection milestone. It supplementary to the technical RFCs about specific components.

Hzfengsy · 2022-08-24T15:41:48Z

Thanks @tqchen!!! I'm excited to see the pre-RFC become this formal RFC.

The Unity Connection is a great step from multi-level lowering compilation to a flexible, unified abstraction for the end-to-end model compilation. I'd like to summarize the discuss thread here for readers who did not participate.

Modularized Compilation Flow

The TVM unity uses cross-layer abstraction to represent:

Graph IR: how to organize ops/kernels
Tensor IR/Libraries/FFI: how to execute the ops/kernels
Based on such abstraction, we can build the module at any stage as long as the module is legal. However, the current multi-stage lowering pipeline requires we must have a GraphIR->TensorIR->RuntimeModule pipeline.

Easy to customize

Fast customization is critical during researching and prototyping. The modularized workflow natively enables it. Here I'd like to share two cases:

Ex1: Adding new operator supports

Instead of 7-step tutorial, here are only two steps with the unity connection:

implement how the op is computed (both tir or libraries are good),
Directly call the implementation in the unified abstraction.

Ex2: Customizable operator fusion

Each pass is decoupled, which means we are able to fuse operators (to be general, optimize the module) in multiple passes. i.e. We can have a customized pass to fuse two convs while using the internal fusor to fuse the following element-wise ops.

Cross-layer optimization opportunities

Layout optimization is a typical cross-layer optimization, that we are able to do with TVM Unity. Also, we do have some prototype results to prove it works. Additionally, I'm happy to see the community members working on different backends are all looking forward to this feature.

Note that this RFC is a technical strategy, which is a bit different with the Relax Upstream RFC #89. Please turn to that thread if you have specific comments on relax itself.

Love to hear ideas from the community.

zhiics

I've already read the pre-RFC in the discussion forum and shared some thoughts there. Here I only have some comments on the grammatical nits and typos.

rfcs/0091-establish-tvm-unity-connection.md

ZihengJiang

Looks great to me and I don't have further opinion since I have already left my comments in the discussion forum. Excited to see this happens and looking forward to those key components being landed.

tqchen · 2022-08-30T14:45:14Z

opened a voting thread apache/tvm#12651

tqchen · 2023-08-25T15:20:27Z

Thanks, everyone, for putting effort into making unity development happen.

Today, we come to the one-year mark of the unity connection proposal. It is amazing to see how the landscape of AI/ML has changed and how some of the emerging needs fit right into the strategy we bought one year ago. Because of the time delays. The strategy as it is no longer timely sense as it was one year ago. So we decided to withdraw the proposal and work to come up with new ones.

It is now obvious that if we took the default option of not pursuing or delaying the strategy standing at last year's point,, we would miss the opportunity to empower the community in the age of genAI.

Luckily, we did not let that slip away and pushed the development of unity branch. Many of you may have seen the news lately on how they bear fruits in enabling real applications like bringing LLMs and other foundational models to various devices.

The good news is that we have clarified strategy decision process to empower strategic proposals. There are also new emerging opportunities in the age of genAI and foundational models, which majority community support.

Please follow the unity branch, continue participating in community discussions, and let us bring TVM to enable everyone. Thank you to many community members who supported the proposal and believed in what it can bring. Let us do our best pushing unity development. We will also collect inputs and come up with new versions of strategies that reflects the current inputs.

junrushao · 2023-08-25T22:47:06Z

Thanks everyone for the year-long discussion.

I'd love to note that, in respective, we would have already missed the boat of generative AI, and Apache TVM would have lost momentum of empowering the community on the up-to-date workloads they are interested in, if we decided to follow the default option not pursuing a real solution, and to stagnate with endless debates without concrete action taken.

I'm super glad to have a set of decisive PMC staff who care and a broad vibrant community who contribute and help. We work collaboratively on the Unity branch, this super valuable technical directions, catching up and empowering generative AI together!

tqchen force-pushed the main branch from b0461d7 to 21b4570 Compare August 24, 2022 13:14

Establish TVM Unity Connection -- A Technical Strategy

48e3e49

This RFC summarizes an overall technical strategy on establish TVM Unity connection milestone. It supplementary to the technical RFCs about specific components.

tqchen force-pushed the main branch from 21b4570 to 48e3e49 Compare August 24, 2022 13:15

Hzfengsy self-assigned this Aug 24, 2022

tqchen changed the title ~~Establish TVM Unity Connection -- A Technical Strategy~~ [RFC] Establish TVM Unity Connection -- A Technical Strategy Aug 24, 2022

zhiics reviewed Aug 27, 2022

View reviewed changes

ZihengJiang approved these changes Aug 29, 2022

View reviewed changes

tqchen mentioned this pull request Aug 30, 2022

[VOTE] Establish TVM Unity Connection Technical Strategy apache/tvm#12651

Closed

ZihengJiang mentioned this pull request Oct 4, 2022

[RFC] Relax Upstreaming #89

Closed

tqchen force-pushed the main branch 3 times, most recently from 55b4139 to e9ebefd Compare January 8, 2023 17:11

tqchen force-pushed the main branch 4 times, most recently from 53fe95b to 48e3e49 Compare August 3, 2023 16:18

leandron mentioned this pull request Aug 4, 2023

[Process RFC] Clarify Community Strategy Decision Process #102

Merged

tqchen closed this Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Establish TVM Unity Connection -- A Technical Strategy #91

[RFC] Establish TVM Unity Connection -- A Technical Strategy #91

tqchen commented Aug 24, 2022 •

edited

Loading

Hzfengsy commented Aug 24, 2022

zhiics left a comment •

edited

Loading

ZihengJiang left a comment

tqchen commented Aug 30, 2022

tqchen commented Aug 25, 2023 •

edited

Loading

junrushao commented Aug 25, 2023 •

edited

Loading

[RFC] Establish TVM Unity Connection -- A Technical Strategy #91

[RFC] Establish TVM Unity Connection -- A Technical Strategy #91

Conversation

tqchen commented Aug 24, 2022 • edited Loading

Hzfengsy commented Aug 24, 2022

Modularized Compilation Flow

Easy to customize

Ex1: Adding new operator supports

Ex2: Customizable operator fusion

Cross-layer optimization opportunities

zhiics left a comment • edited Loading

Choose a reason for hiding this comment

ZihengJiang left a comment

Choose a reason for hiding this comment

tqchen commented Aug 30, 2022

tqchen commented Aug 25, 2023 • edited Loading

junrushao commented Aug 25, 2023 • edited Loading

tqchen commented Aug 24, 2022 •

edited

Loading

zhiics left a comment •

edited

Loading

tqchen commented Aug 25, 2023 •

edited

Loading

junrushao commented Aug 25, 2023 •

edited

Loading