vllm
27b78c73 - [Kernel] add triton fused moe kernel for gptq/awq (#12185)

Comment changes are shownComment changes are hidden
Commit
166 days ago
[Kernel] add triton fused moe kernel for gptq/awq (#12185)
Author
Parents
  • tests/kernels
    • File
      test_moe.py
  • vllm/model_executor/layers
    • fused_moe
      • File
        fused_moe.py
    • quantization
      • File
        __init__.py
      • File
        moe_wna16.py
Loading