vllm
5a787d3b
- fix flashinfer
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
7 days ago
fix flashinfer Signed-off-by: yewentao256 <zhyanwentao@126.com>
References
#29125 - [Feature] Batch invariant: Enable `TRITON_MLA` without prefix-caching
Author
yewentao256
Parents
2e1d0f86
Loading