onnxruntime
ccbd778d - optimize CPU implementation of EmbedLayerNorm (#2491)

Commit
6 years ago
optimize CPU implementation of EmbedLayerNorm (#2491) * optimize CPU implementation of EmbedLayerNorm * use atomic in parallelization
Author
Committer
Parents
Loading