vllm
Support W8A8 INT8 MoE for compressed-tensors
#16745
Merged

Support W8A8 INT8 MoE for compressed-tensors #16745

mgoin
mgoin Support W8A8 INT8 MoE for compressed-tensors
78ae42d3
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 240 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 240 days ago
github-actions
mgoin Merge branch 'main' into support-int8-w8a8-moe
39c477fa
mgoin Update message
8696b6d3
mgoin mgoin added quantization
mgoin mgoin added moe
mgoin mgoin added ready
bnellnm
bnellnm approved these changes on 2025-04-29
ElizaWszola
ElizaWszola commented on 2025-04-30
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-05-02
tlrmchlsmth tlrmchlsmth merged 868c546d into main 224 days ago
mgoin mgoin deleted the support-int8-w8a8-moe branch 224 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone