vllm
e4ee6586 - [Model] add optimal triton fused moe configs for NemotronH MoE (#27967)

Commit
103 days ago
[Model] add optimal triton fused moe configs for NemotronH MoE (#27967) Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>
Author
Parents
Loading