llama.cpp
f2f08f84
- server: improve speed of speculative decoding
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
154 days ago
server: improve speed of speculative decoding
References
#17808 - server: improve speed of speculative decoding
#51 - (FOR CI) Xsn/server improve spec
Author
ngxson
Parents
933414c0
Loading