vllm
[Bugfix] Cache added_vocab to avoid per-token overhead
#30351
Merged

Commits
  • fix: cache added_vocab to avoid per-token overhead
    limingliang committed 170 days ago
  • return a shallow copy of self._added_vocab
    limingliang committed 170 days ago
Loading