vllm
b4c8fbaa
- Add TRTLLM MoE NVFP4 kernel to CompressedTensorsW4A4MoeMethod (#28892)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
70 days ago
Add TRTLLM MoE NVFP4 kernel to CompressedTensorsW4A4MoeMethod (#28892) Signed-off-by: mingyuanm <mingyuanm@nvidia.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#28892 - Add TRTLLM MoE NVFP4 kernel to CompressedTensorsW4A4MoeMethod
Author
Victor49152
Parents
e99e4673
Loading