llama.cpp
1dd12133 - refactor: Remove n_embd_k/v_s from unified cache

Commit
114 days ago
refactor: Remove n_embd_k/v_s from unified cache No longer needed now that unified isn't also supporting recurrent https://github.com/ggml-org/llama.cpp/pull/13979#discussion_r2140761069 Branch: HybridRecurrentCache
Author
Committer
Parents
Loading