onnxruntime
276918d9 - Allow SkipLayerNorm and LayerNorm in rotary attention fusion (#18288)

Commit
2 years ago
Allow SkipLayerNorm and LayerNorm in rotary attention fusion (#18288) Although SimplifiedLayerNorm is faster than LayerNorm, DML doesn't have an optimized implementation for the former yet and LayerNorm ends up being faster.
Parents
Loading