llama.cpp
d9c6ce46
- kv-cache : support V-less cache (#19067)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
117 days ago
kv-cache : support V-less cache (#19067) * kv-cache : support V-less cache * cuda : better check for V_is_K_view * cuda : improve V_is_K_view check * graph : add comments * hparams : refactor
References
#19067 - kv-cache : support V-less cache
Author
ggerganov
Parents
70d86082
Loading