[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1 #16570

f2013519 · 2024-02-14T07:51:35Z

Summary

This PR adds support for partitioning and compiling the Marvell BYOC target along with initial integration with tvmc. Support for the runtime (simulator based & hardware) and other features (int8) will be added in phases as described in the pre-RFC post: https://discuss.tvm.apache.org/t/prerfc-byoc-integrating-marvell-ml-ai-accelerator-to-the-tvm-byoc-framework/16155.

Please see the pre-RFC for a detailed description of the design and roadmap going forward.

Building

We have introduced a new cmake flag:
USE_MRVL=ON/OFF

This flag enables Marvell BYOC codegen and is required for using the Marvell BYOC functionality, running unit tests, etc.

Usage

The tvmc interface for Marvell BYOC will be similar to other composite targets.

The below command is an example of cross-compilation of an ONNX model for an Octeon target.

python3 -m tvm.driver.tvmc compile
--target="mrvl, llvm"
--target-llvm-mtriple=aarch64-linux-gnu
--target-llvm-mcpu=neoverse-n2
--target-mrvl-num_tiles=4
--cross-compiler aarch64-linux-gnu-gcc
--output model.tar
model.onnx

Supported operators will be partitioned and compiled for the MLIP and the remaining operators will be compiled for the ARM Neoverse cores using the default LLVM target.

TVM Python API based compilation is also supported, please refer to the doc added as part of this PR for details.

f2013519 · 2024-02-15T19:49:45Z

cc @Hzfengsy @vinx13

Hzfengsy

LGTM. BTW, as we are moving to TVM Unity / Relax, would be great if you could consider supporting MRVL for unity flow for emerging needs (e.g. LLMs)

f2013519 · 2024-02-16T06:18:41Z

LGTM. BTW, as we are moving to TVM Unity / Relax, would be great if you could consider supporting MRVL for unity flow for emerging needs (e.g. LLMs)

Thanks @Hzfengsy. We are closely following TVM Unity / Relax development and it is part of our future roadmap. We will work on it once we complete the initial Relay flow support.

f2013519 force-pushed the main branch from 86ca179 to 8b06e03 Compare February 15, 2024 06:08

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1

af0b02e

f2013519 force-pushed the main branch from 8b06e03 to af0b02e Compare February 15, 2024 06:08

Hzfengsy approved these changes Feb 16, 2024

View reviewed changes

Hzfengsy merged commit 5645c52 into apache:main Feb 16, 2024
19 checks passed

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

f2013519 mentioned this pull request Apr 22, 2024

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 2 #16915

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1 #16570

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1 #16570

f2013519 commented Feb 14, 2024

f2013519 commented Feb 15, 2024

Hzfengsy left a comment

f2013519 commented Feb 16, 2024

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1 #16570

[Marvell BYOC]: Marvell AI Accelerator Integration - Phase 1 #16570

Conversation

f2013519 commented Feb 14, 2024

Summary

Building

Usage

f2013519 commented Feb 15, 2024

Hzfengsy left a comment

Choose a reason for hiding this comment

f2013519 commented Feb 16, 2024