llama : KV cache view API + better KV cache management #4170
llama : keep track of used KV cells + better KV cache management
79cb8f00
llama : zero KV cache used upon clear
671f639c
llama : allow exporting a view of the KV cache (#4180)
5df7d06c
ggerganov
changed the title llama : keep track of used KV cells + better KV cache management llama : KV cache view API + better KV cache management 1 year ago
common : add -dkvc arg for enabling kv cache dumps
f8e9f114
ggerganov
merged
6b0a7420
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub