onnxruntime
Ability to fuse non-square (pruned) attention weights for BERT-like models
#6850
Merged

Ability to fuse non-square (pruned) attention weights for BERT-like models #6850

mfuntowicz
mfuntowicz mfuntowicz requested a review 4 years ago
mfuntowicz mfuntowicz marked this pull request as draft 4 years ago
mfuntowicz mfuntowicz marked this pull request as ready for review 4 years ago
tianleiwu
tianleiwu dismissed these changes on 2021-03-02
snnn
azure-pipelines
tianleiwu
tianleiwu
tianleiwu
azure-pipelines
azure-pipelines
mfuntowicz
tianleiwu
mfuntowicz Ability to fuse non-square (pruned) attention weights.
76e80db6
mfuntowicz Fix invalid comment.
0735c801
mfuntowicz Trigger CI
93593ae2
mfuntowicz mfuntowicz dismissed their stale review via 93593ae2 4 years ago
mfuntowicz mfuntowicz force pushed from 8d906d36 to 93593ae2 4 years ago
mfuntowicz
tianleiwu
tianleiwu approved these changes on 2021-03-05
tianleiwu tianleiwu merged 9126faa3 into master 4 years ago
mfuntowicz mfuntowicz deleted the hf_fuse_pruned_attention branch 4 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone