Skip to content

[ascend] optimize tp > 1 latency by using graph operation and torch_npu op command launcher#143

Draft
CyCle1024 wants to merge 1 commit intoDeepLink-org:mainfrom CyCle1024:ccy/tp_opt

Commits

Commits on Dec 27, 2024