pytorch
353c2e7d - [Quant] Add fused LinearLeakyReLU module for onednn backend (#88661)

Commit

2 years ago

[Quant] Add fused LinearLeakyReLU module for onednn backend (#88661) **Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused `QLinearLeakyReLU` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule Pull Request resolved: https://github.com/pytorch/pytorch/pull/88661 Approved by: https://github.com/jgong5, https://github.com/jerryzh168

Author

Xia-Weiwen

Committer

pytorchmergebot

Parents

750576a5

pytorch 353c2e7d - [Quant] Add fused LinearLeakyReLU module for onednn backend (#88661)

pytorch
353c2e7d - [Quant] Add fused LinearLeakyReLU module for onednn backend (#88661)