onnxruntime
Fix attention parity for GPT-2
#8549
Merged

Fix attention parity for GPT-2 #8549

tianleiwu
tianleiwu add comments
444dd896
tianleiwu Merge branch 'master' of https://github.com/Microsoft/onnxruntime
4996bc45
tianleiwu Use persistent softmax to parity with huggingface
e15852bb
tianleiwu format
9de56ff3
tianleiwu enable persistent softmax for gpt-2 by default
9507fb58
tianleiwu update test
e18af41d
tianleiwu tianleiwu requested a review 4 years ago
tianleiwu tianleiwu marked this pull request as draft 4 years ago
tianleiwu fix undirectional mask in cpu
b16fce05
tianleiwu tianleiwu force pushed from c7fdf024 to b16fce05 4 years ago
tianleiwu tianleiwu changed the title Use persistent softmax in attention cuda operator for GPT parity Fix attention parity for GPT-2 4 years ago
tianleiwu move reshape remover to post-process
5857e243
tianleiwu clean up header
40d6a291
tianleiwu fix windows build
152e083f
tianleiwu tianleiwu marked this pull request as ready for review 4 years ago
tianleiwu tianleiwu requested a review from yufenglee yufenglee 4 years ago
tianleiwu tianleiwu requested a review from wangyems wangyems 4 years ago
tianleiwu Use persistent softmax to parity with huggingface
538ef199
tianleiwu format
4e9663ea
tianleiwu enable persistent softmax for gpt-2 by default
7843e84d
tianleiwu update test
5aa53eee
tianleiwu fix undirectional mask in cpu
0ba0c811
tianleiwu clean up header
3dc092da
tianleiwu fix windows build
8e4532c8
tianleiwu clean test
0e39b52d
tianleiwu Merge branch 'tlwu/fix_gpt_attention_cuda_hugginface_parity' of https…
9a0a3c59
wangyems
wangyems commented on 2021-07-30
wangyems
wangyems approved these changes on 2021-07-30
tianleiwu tianleiwu merged 330b8e74 into master 4 years ago
tianleiwu tianleiwu deleted the tlwu/fix_gpt_attention_cuda_hugginface_parity branch 4 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone