[Model] add optimal triton fused moe configs for NemotronH MoE #27967
Add NemotronHForCausalLM arch to benchmark_moe.py
ffa56141
Add triton moe configs for nemotronH for TP=1,2 / H100 / L40S (BF16)
17472d3a
heheda12345
changed the title [Model] app optimal triton fused moe configs for NemotronH MoE [Model] add optimal triton fused moe configs for NemotronH MoE 201 days ago
Merge branch 'main' into add-nemotronH-moe-configs
7aaa7af9
tomeras91
deleted the add-nemotronH-moe-configs branch 201 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub