text-generation-inference
1c84a30f - MoE Marlin: support `desc_act` for `groupsize != -1` (#2590)

Commit
1 year ago
MoE Marlin: support `desc_act` for `groupsize != -1` (#2590) This change uses the updated Marlin MoE kernel from vLLM to support MoE with activation sorting and groups.
Author
Parents
Loading