vllm
Fix CompressedTensorsWNA16MoE with grouped scales
#13769
Merged

Fix CompressedTensorsWNA16MoE with grouped scales #13769

mgoin
mgoin Fix CompressedTensorsWNA16MoE with grouped scales
a1e19552
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 303 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 303 days ago
github-actions
mgoin Merge branch 'main' into fix-ct-marlin-moe
61380085
mgoin
dsikka
dsikka commented on 2025-02-24
dsikka
dsikka approved these changes on 2025-02-24
mgoin mgoin added quantization
mgoin mgoin added ready
jeejeelee
jeejeelee approved these changes on 2025-02-25
simon-mo simon-mo merged 4d251ad0 into main 303 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone