onnxruntime
9126faa3
- Ability to fuse non-square (pruned) attention weights for BERT-like models (#6850)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Ability to fuse non-square (pruned) attention weights for BERT-like models (#6850)
References
#6850 - Ability to fuse non-square (pruned) attention weights for BERT-like models
Author
mfuntowicz
Parents
f986ffcb
Loading