llama.cpp
7e13f19f - llama : rethink recurrent state cell counts

Commit
1 year ago
llama : rethink recurrent state cell counts * llama : begin work on support for variable GQA This will also be useful for Jamba if we consider the Mamba layers to have 0 KV heads. * llama : gracefully fail when not finding hybrid slot
Author
Parents
Loading