vllm
[Bugfix] Support MLA for CompressedTensorsWNA16
#13725
Merged

[Bugfix] Support MLA for CompressedTensorsWNA16 #13725

mgoin
mgoin Fix MLA for CompressedTensorsWNA16
0282207b
github-actions
mgoin Merge branch 'main' into fix-mla-w4-ct
6be8aea7
mgoin Fix format
84a84084
mgoin mgoin requested a review from LucasWilkinson LucasWilkinson 298 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 298 days ago
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 298 days ago
mgoin mgoin added bug
mgoin mgoin added quantization
mgoin mgoin changed the title Fix MLA for CompressedTensorsWNA16 [Bugfix] Support MLA for CompressedTensorsWNA16 298 days ago
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-02-24
tlrmchlsmth tlrmchlsmth added ready
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 298 days ago
kylesayrs
kylesayrs approved these changes on 2025-02-24
tlrmchlsmth tlrmchlsmth merged 18e50593 into main 298 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone