onnxruntime
787dcb7d - Support extra addition before softmax in attention cuda kernel (#9205)

Commit
4 years ago
Support extra addition before softmax in attention cuda kernel (#9205) * checkin qk_add in cuda ep * enable test * added todo * review comments
Author
Parents
Loading