llama.cpp
36375762
- server : disable speculative decoding for SWA models (#13970)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
197 days ago
server : disable speculative decoding for SWA models (#13970) * server : use swa-full fo draft context ggml-ci * server : disable speculative decoding for SWA models
References
#13970 - server : disable speculative decoding for SWA models
Author
ggerganov
Parents
ea394d7a
Loading