llama.cpp
d5d7628b - refactor: Remove n_embd_k/v_gqa from recurrent cache

Commit

114 days ago

refactor: Remove n_embd_k/v_gqa from recurrent cache This is no longer needed now that there are separate implementations https://github.com/ggml-org/llama.cpp/pull/13979#discussion_r2140825128 Branch: HybridRecurrentCache Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

References

#13979 - Hybrid recurrent cache

Author

gabe-l-hart

Committer

gabe-l-hart

Parents

b42c8b43

llama.cpp d5d7628b - refactor: Remove n_embd_k/v_gqa from recurrent cache

llama.cpp
d5d7628b - refactor: Remove n_embd_k/v_gqa from recurrent cache