vllm
57c629e9 - [Bugfix] Fix block_size for hybrid model MTP (#36036)

Commit
55 days ago
[Bugfix] Fix block_size for hybrid model MTP (#36036) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Author
Parents
Loading