vllm
de509ae8 - [NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels (#21411)

Commit
282 days ago
[NVIDIA] Explicitly disable shuffled weights for flashinfer blockscale moe fp8 kernels (#21411) Signed-off-by: kaixih <kaixih@nvidia.com>
Author
Parents
Loading