vllm
332d4cb1
- [Feature][Quantization] MXFP4 support for MOE models (#17888)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
157 days ago
[Feature][Quantization] MXFP4 support for MOE models (#17888) Signed-off-by: Felix Marty <felmarty@amd.com> Signed-off-by: Bowen Bao <bowenbao@amd.com> Signed-off-by: Felix Marty <Felix.Marty@amd.com> Co-authored-by: Bowen Bao <bowenbao@amd.com>
References
#17888 - [Feature][Quantization] MXFP4 support for MOE models
Author
fxmarty-amd
Parents
bf03ff35
Loading