vllm
28ef9ba3
- [BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA (#34552)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 days ago
[BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA (#34552) Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Matthew Bonanni <mbonanni@redhat.com>
References
#34552 - [BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA
Author
LucasWilkinson
Parents
fb7fdc49
Loading