vllm
57c629e9
- [Bugfix] Fix block_size for hybrid model MTP (#36036)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
55 days ago
[Bugfix] Fix block_size for hybrid model MTP (#36036) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
References
#36036 - [Bugfix] Fix block_size for hybrid model MTP
Author
benchislett
Parents
d106bf39
Loading