vllm
116ed130 - [Bugfix] Fix GDN attention crash with mixed decode/spec-decode batches (#34871)

Commit
43 days ago
[Bugfix] Fix GDN attention crash with mixed decode/spec-decode batches (#34871) Signed-off-by: haosdent <haosdent@gmail.com>
Author
Parents
Loading