Skip to content

Apache TVM v0.19.0

Pre-release
Pre-release
Compare
Choose a tag to compare
@ysh329 ysh329 released this 24 Jan 02:05
· 11 commits to main since this release

Introduction

The TVM community has worked since the last release to deliver the following new exciting improvements!

The main tags are below (bold text is with lots of progress): Relax, OpenCL, MetaSchedule.

Please visit the full listing of commits for a complete view: v0.19.dev0...v0.19.0.rc0.

Community

None.

RFCs

None.

Arith

  • #17469 - [LLVM]Presburger compile fix for MLIR/LLVM 19.x

BugFix

  • #17595 - [Fix][KVCache] Fix incorrect tile size calculation
  • #17549 - [FIX][LLVM] Workaround -mcpu=apple-latest for llvm above 18.0 (#17492)
  • #17537 - [FIX][topi.scatter_nd] fixed shape equality assert by using analyzer to prove equality
  • #17502 - [FIX][TOPI][strided_slice] Fix topi.strided_slice output shape
  • #17505 - [RELAX][ONNX][FIX] add a parser to handle expression in the shape dim names
  • #17490 - [FIX][ONNX][RELAX] Add support for dynamic ShapeExpr in Slice, Squeeze and Flatten
  • #17467 - [FIX][RELAX][ONNX] Fix typo in onnx frontend

CI

  • #17596 - [Test] Skip flaky test to unblock CI
  • #17451 - Upgrade CI image to 20241105-030952-3e386fd3
  • #17534 - Upgrade zephyr-sdk to 0.16.9
  • #17503 - Upgrade oneflow==0.9.0
  • #17485 - Revert jax, keras, tensorflow, and tflite upgrades introduced #17425
  • #17470 - Pin cpplint==1.6.1

Docs

  • #17518 - Few fixes for broken Adreno docs
  • #17527 - Fix typo in TensorIR
  • #17528 - Fix Typo in Debugging TVM

LLVM

  • #17547 - Make compilable with LLVM-20
  • #17538 - [RUNTIME] Make ORCJIT LLVM executor the default one

MetaSchedule

  • #17465 - Fix a multilevel tiling error on dynamic relax workload

OpenCL & CLML

  • #17516 - [RUNTIME][CLML] Dynamic backward compatibility
  • #17519 - [OPENCL][ADRENO] Introduce Qualcomm extension support
  • #17517 - [TEST][CLML] Clip test case updated
  • #17472 - [Device][OpenCL] add CL_EXEC_STATUS_ERROR_FOR_EVENTS_IN_WAIT_LIST to …

Relax

  • #17541 - Fix bug in convert_layout pass
  • #17539 - [KVCache] Fix attention prefill kernel for Metal and Android
  • #17540 - Add support for ONNX LPPool
  • #17536 - [Frontend][Onnx] Add auto_pad support for conv
  • #17525 - support masked_scatter
  • #17506 - [Python]Update Rotary positional embedding scaling
  • #17523 - Add gather_elements and gather_nd operators
  • #17511 - Update ONNX frontend for unique, nonzero and compress
  • #17509 - support scatter ops
  • #17504 - [ONNX] Add support for dynamic shape expression in Expand
  • #17482 - [KVCACHE] Improved schedule for prefill attention
  • #17445 - [MetaSchedule] Support CPU weight prepack
  • #17462 - Enhance Relax op and ONNX frontend
  • #17466 - Revert "[KVCACHE] Improved schedule for prefill attention"

Runtime

  • #17557 - [Dist] Implementation of KV cache transfer
  • #17498 - [mrvl]: Support Marvell Hardware Runtime

TIR

  • #17423 - [Schedule] Add annotate_buffer_access primitive

web

  • #17545 - Allows setting powerPreference on webgpu

Misc

  • #17593 - Fix GPU detection in PerStoreFeatureNode
  • #17554 - [Refactor] Phase out microTVM
  • #17542 - [REFACTOR] Phase out VTA
  • #17533 - [Contrib] Remove CLML version print
  • #17532 - [3rdparty] Update Picojson with const operator[] function (#327)
  • #17474 - [TE][CreatePrimFunc] Fix loop carried dependency case with nested block levels
  • #17501 - Fix InternalError in StaticPlanBlockMemory when visiting DataflowBlockNode
  • #17455 - Compiled with Default Target(LLVM) and Built with USE_MRVL=ON
  • #17481 - [Marvell BYOC]: global_max_pool2d and squeeze op support
  • #17484 - Replace np.int with np.int32
  • #17476 - Pin pytest-profiling==1.7.0
  • #17464 - [JVM] Align Java GraphModule Initialization with Python API
  • #17458 - Show the record if the escape sequence is unsupported