llama.cpp
36375762 - server : disable speculative decoding for SWA models (#13970)

Commit
197 days ago
server : disable speculative decoding for SWA models (#13970) * server : use swa-full fo draft context ggml-ci * server : disable speculative decoding for SWA models
Author
Parents
Loading