onnxruntime
87d68d85
- matmul integer fusion (#4195)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 years ago
matmul integer fusion (#4195) * Introduce DynamicQuantizeMatMul It fuses DynamicQuantizeLinear, MatMul and following cast, multiplier. It gets float in and float out for quantized matmul. We have a MLAS kernel in implementation for this op.
References
#4195 - matmul integer fusion
Author
yufenglee
Parents
2605faef
Loading