pytorch
a71d9a92 - [Quant] Add fused conv2d_add_relu op for onednn backend (#90364)

Commit
1 year ago
[Quant] Add fused conv2d_add_relu op for onednn backend (#90364) **Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/90364 Approved by: https://github.com/jgong5, https://github.com/jerryzh168
Committer
Parents
Loading