[Relax][Frontent] "tensor_ir_inplace" op #16498

MasterJH5574 · 2024-01-31T14:56:59Z

This PR introduces the tensor_ir_inplace_op for frontend so that we can leverage our call_tir_inplace in SLM model definition flow.

One unit test is added. This PR also fixed a few typos in type annotations.

This PR introduces the `tensor_ir_inplace_op` for frontend so that we can leverage our `call_tir_inplace` in SLM model definition flow. One unit test is added. This PR also fixed a few typos in type annotations.

slyubomirsky · 2024-02-01T21:27:31Z

Thanks for fixing the typos 🙂 Can't believe I had them there in the first place. Is the main reason to add this op so you could use an inline PrimFunc with it?

MasterJH5574 · 2024-02-02T01:43:37Z

@slyubomirsky You're welcome! The main reason introducing this op is to use call_tir_inplace under the principle of our new nn.Module interface: so that we can hide the definition of the TIR in a file (e.g., https://github.com/mlc-ai/mlc-llm/blob/0c5f88155/python/mlc_chat/model/llama/llama_model.py#L295-L303), and expose a clean interface without Relax concept to frontend model definition side (e.g., https://github.com/mlc-ai/mlc-llm/blob/0c5f88155/python/mlc_chat/model/llama/llama_model.py#L295-L303).

csullivan · 2024-03-07T19:59:04Z

tests/python/relax/test_frontend_nn_op.py

+        def test(
+            self, embedding_table: Tensor, input_ids: Tensor, embedding_dst: Tensor, offset: int
+        ):
+            tensor_expr_op_out = op.tensor_ir_op(


Did you mean to call op.tensor_ir_inplace_op here? It doesn't look like you are testing the new nn.op.tensor_ir_inplace_op added in this PR.

Ooooooops sorry my bad. Is that updated? Or I can find a chance to update the test next time.

No worries, I was just hoping to use this PR to guide my use of nn.op.tensor_ir_inplace_op but then became nervous about using the feature when I saw it wasn't tested. I haven't made any change to update the test and would appreciate the update whenever you have cycles to come back to this. It's not blocking me though, so low priority is fine. Thanks @MasterJH5574

Thank you so much for letting me know!

[Relax][Frontent] "tensor_ir_inplace" op

fa53e66

This PR introduces the `tensor_ir_inplace_op` for frontend so that we can leverage our `call_tir_inplace` in SLM model definition flow. One unit test is added. This PR also fixed a few typos in type annotations.

MasterJH5574 mentioned this pull request Jan 31, 2024

[Serving] In-place embedding lookup mlc-ai/mlc-llm#1691

Closed

jinhongyii approved these changes Feb 2, 2024

View reviewed changes

jinhongyii merged commit 5c68932 into apache:main Feb 2, 2024
19 checks passed

csullivan reviewed Mar 7, 2024

View reviewed changes

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relax][Frontent] "tensor_ir_inplace" op #16498

[Relax][Frontent] "tensor_ir_inplace" op #16498

MasterJH5574 commented Jan 31, 2024

slyubomirsky commented Feb 1, 2024

MasterJH5574 commented Feb 2, 2024

csullivan Mar 7, 2024

MasterJH5574 Mar 11, 2024

csullivan Mar 11, 2024

MasterJH5574 Mar 11, 2024

[Relax][Frontent] "tensor_ir_inplace" op #16498

[Relax][Frontent] "tensor_ir_inplace" op #16498

Conversation

MasterJH5574 commented Jan 31, 2024

slyubomirsky commented Feb 1, 2024

MasterJH5574 commented Feb 2, 2024

csullivan Mar 7, 2024

Choose a reason for hiding this comment

MasterJH5574 Mar 11, 2024

Choose a reason for hiding this comment

csullivan Mar 11, 2024

Choose a reason for hiding this comment

MasterJH5574 Mar 11, 2024

Choose a reason for hiding this comment