Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers " #34997
Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K p…
365da4f8
mgoin
approved these changes
on 2026-02-20
mgoin
enabled auto-merge (squash) 44 days ago
vllm-bot
merged
0e22cd61
into main 44 days ago
vllm-bot
deleted the revert-34471-fix-llama4-quant-loading branch 44 days ago
Assignees
No one assigned
Labels
ready
ci-failure
llama
Login to write a write a comment.
Login via GitHub