vllm
[Bugfix] Fix block_size for hybrid model MTP
#36036
Merged

[Bugfix] Fix block_size for hybrid model MTP #36036

benchislett
benchislett Use the proper block size for drafting
23041c4a
benchislett benchislett requested a review from LucasWilkinson LucasWilkinson 71 days ago
benchislett benchislett requested a review from luccafong luccafong 71 days ago
benchislett benchislett requested a review from MatthewBonanni MatthewBonanni 71 days ago
mergify mergify added speculative-decoding
mergify mergify added v1
mergify mergify added bug
gemini-code-assist
gemini-code-assist commented on 2026-03-04
LucasWilkinson
LucasWilkinson approved these changes on 2026-03-04
benchislett benchislett added ready
benchislett benchislett enabled auto-merge (squash) 70 days ago
benchislett set block size in test
24c3aa28
mergify
benchislett Merge branch 'main' into eagle-fix-block-size
92d123ff
benchislett benchislett merged 57c629e9 into main 70 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone