Skip to content

Use BLAS to implement ggml_compute_forward_out_prod_f32 for matrix src0, src1 (finetuning speedup ~5x).#4079

Merged
ggerganov merged 4 commits intoggerganov:masterfrom gwjr:out-prod-using-blasNov 17, 2023