llama.cpp
833dfb54
- fix: Use per-layer n_embd_k/v_s calls for mamba (1) layers
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
114 days ago
fix: Use per-layer n_embd_k/v_s calls for mamba (1) layers Branch: HybridRecurrentCache Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
References
#13979 - Hybrid recurrent cache
Author
gabe-l-hart
Committer
gabe-l-hart
Parents
f6d5f055
Loading