onnxruntime
a74e41e4
- Add non-zero zp support for quant matmul and attention (#7570)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Add non-zero zp support for quant matmul and attention (#7570) * add non-zero zp support * support A and B scale with any dimensions
References
#7570 - Add non-zero zp support for quant matmul and attention
Author
yufenglee
Parents
c53b5be5
Loading