vllm
de509ae8
- [NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels (#21411)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
282 days ago
[NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels (#21411) Signed-off-by: kaixih <kaixih@nvidia.com>
References
#21411 - [NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels
Author
kaixih
Parents
e7c4f9ee
Loading