vllm
507df79a
- [Hybrid] Simplify accepted token counting in spec decode for hybrid models (#38372)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
17 days ago
[Hybrid] Simplify accepted token counting in spec decode for hybrid models (#38372)
References
#38372 - [Hybrid] Simplify accepted token counting in spec decode for hybrid models
Author
fuscof-ibm
Parents
1696c864
Loading