Skip to content

CUDA: add FP32 FlashAttention vector kernel#7188

Merged
JohannesGaessler merged 4 commits intoggml-org:masterfrom JohannesGaessler:cuda-fa-no-tc-11May 12, 2024