llama.cpp
af99c6fb
- llama : remove memory_f16 and kv_f16 flags
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
llama : remove memory_f16 and kv_f16 flags
References
gg/quantum-k-cache
#4312 - llama : support quantum K cache
Author
ggerganov
Parents
4adb1d69
Loading