vllm
28ef9ba3 - [BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA (#34552)

Commit
2 days ago
[BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA (#34552) Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Matthew Bonanni <mbonanni@redhat.com>
Parents
Loading