vllm
0b99f5d3
- support flashinfer_fp4 moe for 5090 gpu (#26669)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
195 days ago
support flashinfer_fp4 moe for 5090 gpu (#26669) Signed-off-by: XiaobingSuper <xiaobingzhangupc@gmail.com> Signed-off-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
Author
XiaobingSuper
Parents
1f491aa0
Loading