vllm
e4ee6586
- [Model] add optimal triton fused moe configs for NemotronH MoE (#27967)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
103 days ago
[Model] add optimal triton fused moe configs for NemotronH MoE (#27967) Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>
References
#27967 - [Model] add optimal triton fused moe configs for NemotronH MoE
Author
tomeras91
Parents
77f8001f
Loading