llama.cpp
a15ca746
- speculative : print encoding speed
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
speculative : print encoding speed
References
#2926 - speculative : PoC for speeding-up inference via speculative sampling
Author
ggerganov
Committer
ggerganov
Parents
c82c808d
Loading