llama.cpp
59db9a35
- llama: dynamic head_dim and n_rot for SWA (#20301)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
35 days ago
llama: dynamic head_dim and n_rot for SWA (#20301) * llama: dynamic head_dim and n_rot for SWA * also add gguf_writer wrappers * fix build * build_rope_shift arg reorder
References
#20301 - llama: dynamic head_dim and n_rot for SWA
Author
ngxson
Parents
23fbfcb1
Loading