Skip to content

CUTLASS 1.2

Compare
Choose a tag to compare
@kerrmudgeon kerrmudgeon released this 26 Oct 22:02
ed2ed4d

CUTLASS 1.2.0
(2018-10-26)

  • Parallelized reductions across threadblocks ("Split-K")
  • Improved IGEMM performance
  • Batched strided WMMA GEMMs