onnxruntime
9206b7cd - Zhijxu/cast propagation softmax (#16408)

Commit
2 years ago
Zhijxu/cast propagation softmax (#16408) enhance cast-propagation for "softmax can be put at fp16 when data flow is cast-to-fp32 > softmax > cast-to-fp16" this optimization can save gpu memory and have performance gain
Author
Parents
Loading