onnxruntime
4e6ea730 - Broadcasting for SLN for CPU and CUDA (#16510)

Commit

2 years ago

Broadcasting for SLN for CPU and CUDA (#16510) ### Description Enhanced SkipLayerNorm by implementing broadcasting for both CPU and CUDA ### Motivation and Context The input and skip tensors no longer have to be the same size which means that it can accept data where the skip shape can be the same size as the input shape, have a shape of {1, sequence_length, hidden_size}, or {sequence_length, hidden_size}. --------- Co-authored-by: Tianlei Wu <tlwu@microsoft.com>

References

#16510 - Broadcasting for SLN for CPU and CUDA

Author

khspear

Parents

3649376f

onnxruntime 4e6ea730 - Broadcasting for SLN for CPU and CUDA (#16510)

onnxruntime
4e6ea730 - Broadcasting for SLN for CPU and CUDA (#16510)