llama.cpp
7e13f19f
- llama : rethink recurrent state cell counts
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
llama : rethink recurrent state cell counts * llama : begin work on support for variable GQA This will also be useful for Jamba if we consider the Mamba layers to have 0 KV heads. * llama : gracefully fail when not finding hybrid slot
References
#7531 - llama : support Jamba hybrid Transformer-Mamba models
Author
compilade
Parents
3b57b55c
Loading