vllm
11dfdf21 - [Kernel] DeepGemm MoE : Integrate triton permute / unpermute kernels (#20903)

Commit
159 days ago
[Kernel] DeepGemm MoE : Integrate triton permute / unpermute kernels (#20903) Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>
Parents
Loading