llama.cpp
d5d7628b - refactor: Remove n_embd_k/v_gqa from recurrent cache

Commit
114 days ago
refactor: Remove n_embd_k/v_gqa from recurrent cache This is no longer needed now that there are separate implementations https://github.com/ggml-org/llama.cpp/pull/13979#discussion_r2140825128 Branch: HybridRecurrentCache Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
Author
Committer
Parents
Loading