llama.cpp
f0995d28
- metal : use FA-vec kernel up to batch size 20 (#13496)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
177 days ago
metal : use FA-vec kernel up to batch size 20 (#13496) * batched-bench : fix pp batch contents * metal : optimize multi-sequence FA vec kernel ggml-ci * metal : use FA-vec kernel up to batch size 20 ggml-ci
References
#13496 - metal : use FA-vec kernel up to batch size 20
Author
ggerganov
Parents
c252e0c4
Loading