llama.cpp
c1596f63 - llama : fix kv cache heuristic when context is less than 32

Commit
1 year ago
llama : fix kv cache heuristic when context is less than 32
Author
Parents
Loading