onnxruntime
68b9d9bf - [CUDA] BF16 MoE and qMoE (#25572)

Commit
194 days ago
[CUDA] BF16 MoE and qMoE (#25572) Add support of bfloat16 in MoE and qMoE cuda ops.
Author
Parents
Loading