vllm
116ed130
- [Bugfix] Fix GDN attention crash with mixed decode/spec-decode batches (#34871)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
43 days ago
[Bugfix] Fix GDN attention crash with mixed decode/spec-decode batches (#34871) Signed-off-by: haosdent <haosdent@gmail.com>
References
#34871 - [Bugfix] Fix GDN attention crash with mixed decode/spec-decode batches
Author
haosdent
Parents
8374387b
Loading