vllm
de92d916
- [NVIDIA] Add support for cudnn fp4 gemm via flashinfer (#26107)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
161 days ago
[NVIDIA] Add support for cudnn fp4 gemm via flashinfer (#26107) Signed-off-by: kaixih <kaixih@nvidia.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#26107 - [NVIDIA] Add support for cudnn fp4 gemm via flashinfer
Author
kaixih
Parents
a1063628
Loading