onnxruntime
fd16085c - Zhanyao/attention (#10545)

Commit
4 years ago
Zhanyao/attention (#10545) * Enable Attention op for ROCM EP. As a note, potential hipify improvements: (1) handle math contants (attention_softmax.h), (2) correctly generate transpose options for the GEMM helpers, consider counterpart/dummy API for CublasMathModeSetter (attention_impl.cu, attention_impl.cu). After these improvements, we don't need to manually keep copies of the above mentioned files any more. * Clean up debugging code.
Author
Parents
Loading