vllm
Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers "
#34997

Merged

Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers " #34997

vllm-bot merged 1 commit into main from revert-34471-fix-llama4-quant-loading

Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K p…

365da4f8

mergify added llama

gemini-code-assist commented on 2026-02-20

LucasWilkinson added ready

LucasWilkinson requested a review from

DarkLight1337 44 days ago

LucasWilkinson requested a review from

DarkLight1337 44 days ago

LucasWilkinson requested a review from

robertgshaw2-redhat 44 days ago

LucasWilkinson requested a review from

robertgshaw2-redhat 44 days ago

mgoin approved these changes on 2026-02-20

mgoin added ci-failure

mgoin enabled auto-merge (squash) 44 days ago

vllm-bot merged 0e22cd61 into main 44 days ago

vllm-bot deleted the revert-34471-fix-llama4-quant-loading branch 44 days ago

Reviewers

mgoin

gemini-code-assist

DarkLight1337

robertgshaw2-redhat

Assignees

No one assigned

Labels

ready ci-failure llama

Milestone

No milestone