llama.cpp
1fb2658b
- server: introduce self-speculative decoding
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
10 days ago
server: introduce self-speculative decoding
References
#18471 - Add self‑speculative decoding (no draft model required)
Author
srogmann
Committer
srogmann
Parents
8f91ca54
Loading