onnxruntime
Fix attention parity for GPT-2
#8549
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
19
Changes
View On
GitHub
Commits
add comments
tianleiwu
committed
4 years ago
Merge branch 'master' of https://github.com/Microsoft/onnxruntime
tianleiwu
committed
4 years ago
Use persistent softmax to parity with huggingface
tianleiwu
committed
4 years ago
format
tianleiwu
committed
4 years ago
enable persistent softmax for gpt-2 by default
tianleiwu
committed
4 years ago
update test
tianleiwu
committed
4 years ago
fix undirectional mask in cpu
tianleiwu
committed
4 years ago
move reshape remover to post-process
tianleiwu
committed
4 years ago
clean up header
tianleiwu
committed
4 years ago
fix windows build
tianleiwu
committed
4 years ago
Use persistent softmax to parity with huggingface
tianleiwu
committed
4 years ago
format
tianleiwu
committed
4 years ago
enable persistent softmax for gpt-2 by default
tianleiwu
committed
4 years ago
update test
tianleiwu
committed
4 years ago
fix undirectional mask in cpu
tianleiwu
committed
4 years ago
clean up header
tianleiwu
committed
4 years ago
fix windows build
tianleiwu
committed
4 years ago
clean test
tianleiwu
committed
4 years ago
Merge branch 'tlwu/fix_gpt_attention_cuda_hugginface_parity' of https://github.com/Microsoft/onnxruntime into tlwu/fix_gpt_attention_cuda_hugginface_parity
tianleiwu
committed
4 years ago
Loading