vllm
332d4cb1 - [Feature][Quantization] MXFP4 support for MOE models (#17888)

Commit
157 days ago
[Feature][Quantization] MXFP4 support for MOE models (#17888) Signed-off-by: Felix Marty <felmarty@amd.com> Signed-off-by: Bowen Bao <bowenbao@amd.com> Signed-off-by: Felix Marty <Felix.Marty@amd.com> Co-authored-by: Bowen Bao <bowenbao@amd.com>
Author
Parents
Loading