vllm
7c12a765
- [Misc] Simplify the prefix caching logic on draft tokens (#20701)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
157 days ago
[Misc] Simplify the prefix caching logic on draft tokens (#20701) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
References
#20701 - [Misc] Simplify the prefix caching logic on draft tokens
Author
WoosukKwon
Parents
cd587c93
Loading