vllm
d007387a
- [Bugfix] Cache added_vocab to avoid per-token overhead (#30351)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
141 days ago
[Bugfix] Cache added_vocab to avoid per-token overhead (#30351) Signed-off-by: limingliang <limingliang@stepfun.com> Co-authored-by: limingliang <limingliang@stepfun.com>
References
#30351 - [Bugfix] Cache added_vocab to avoid per-token overhead
Author
scratch-ml
Parents
3bdd4266
Loading