vllm
d007387a - [Bugfix] Cache added_vocab to avoid per-token overhead (#30351)

Commit
141 days ago
[Bugfix] Cache added_vocab to avoid per-token overhead (#30351) Signed-off-by: limingliang <limingliang@stepfun.com> Co-authored-by: limingliang <limingliang@stepfun.com>
Author
Parents
Loading