Skip to content

CUDA: mul_mat_vec_q tiling, refactor mul mat logic#5434

Merged
JohannesGaessler merged 7 commits intoggerganov:masterfrom JohannesGaessler:cuda-faster-mmvq-12Feb 11, 2024

Commits

Commits on Feb 9, 2024

Commits on Feb 10, 2024

Commits on Feb 11, 2024