vllm
c7d8724e
- [Core] FlashInfer CUTLASS fused MoE backend (NVFP4) (#20037)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
283 days ago
[Core] FlashInfer CUTLASS fused MoE backend (NVFP4) (#20037) Signed-off-by: shuw <shuw@nvidia.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#20037 - [Core] FlashInfer CUTLASS fused MoE backend (NVFP4)
Author
wenscarl
Parents
b38baabc
Loading