vllm
Revert "[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers "
#34997
Merged

Loading