onnxruntime
68b9d9bf
- [CUDA] BF16 MoE and qMoE (#25572)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
194 days ago
[CUDA] BF16 MoE and qMoE (#25572) Add support of bfloat16 in MoE and qMoE cuda ops.
References
#25572 - [CUDA] BF16 MoE and qMoE
Author
tianleiwu
Parents
866c7e3a
Loading