llama.cpp
e216aa04 - llama : only copy used KV cache in get / set state (#1272)

Commit
2 years ago
llama : only copy used KV cache in get / set state (#1272) * llama : only copy used KV cache in get / set state * switch to ggml for copying k, v * avoid designated initializers
Author
Parents
Loading