vllm
5c538c37 - [V1][Bugfix][Spec Decode] Fix incorrect outputs in V1 speculative decoding due to batch indexing (#14645)

Commit
272 days ago
[V1][Bugfix][Spec Decode] Fix incorrect outputs in V1 speculative decoding due to batch indexing (#14645) Signed-off-by: Benjamin Chislett <benjamin.chislett@centml.ai>
Author
Parents
Loading