llama.cpp
f0995d28 - metal : use FA-vec kernel up to batch size 20 (#13496)

Commit
177 days ago
metal : use FA-vec kernel up to batch size 20 (#13496) * batched-bench : fix pp batch contents * metal : optimize multi-sequence FA vec kernel ggml-ci * metal : use FA-vec kernel up to batch size 20 ggml-ci
Author
Parents
Loading