onnxruntime
Fix attention parity for GPT-2
#8549
Merged

Commits
  • add comments
    tianleiwu committed 4 years ago
  • Merge branch 'master' of https://github.com/Microsoft/onnxruntime
    tianleiwu committed 4 years ago
  • Use persistent softmax to parity with huggingface
    tianleiwu committed 4 years ago
  • format
    tianleiwu committed 4 years ago
  • enable persistent softmax for gpt-2 by default
    tianleiwu committed 4 years ago
  • update test
    tianleiwu committed 4 years ago
  • fix undirectional mask in cpu
    tianleiwu committed 4 years ago
  • move reshape remover to post-process
    tianleiwu committed 4 years ago
  • clean up header
    tianleiwu committed 4 years ago
  • fix windows build
    tianleiwu committed 4 years ago
  • Use persistent softmax to parity with huggingface
    tianleiwu committed 4 years ago
  • format
    tianleiwu committed 4 years ago
  • enable persistent softmax for gpt-2 by default
    tianleiwu committed 4 years ago
  • update test
    tianleiwu committed 4 years ago
  • fix undirectional mask in cpu
    tianleiwu committed 4 years ago
  • clean up header
    tianleiwu committed 4 years ago
  • fix windows build
    tianleiwu committed 4 years ago
  • clean test
    tianleiwu committed 4 years ago
  • Merge branch 'tlwu/fix_gpt_attention_cuda_hugginface_parity' of https://github.com/Microsoft/onnxruntime into tlwu/fix_gpt_attention_cuda_hugginface_parity
    tianleiwu committed 4 years ago
Loading