vllm
de92d916 - [NVIDIA] Add support for cudnn fp4 gemm via flashinfer (#26107)

Commit
161 days ago
[NVIDIA] Add support for cudnn fp4 gemm via flashinfer (#26107) Signed-off-by: kaixih <kaixih@nvidia.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
Author
Parents
Loading