pytorch
cedce3be - [Quant][fx] Add lowering for Linear-Bn1d in QAT mode (#73509)

Commit

2 years ago

[Quant][fx] Add lowering for Linear-Bn1d in QAT mode (#73509) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73509 This adds functionality to lower reference models involving the Linear-Bn1d pattern in FX QAT mode. This follows https://github.com/pytorch/pytorch/pull/72431 and https://github.com/pytorch/pytorch/pull/72796, which add Linear-Bn1d fusion functionality to eager QAT mode. Test Plan: python test/test_quantization.py TestQuantizeFxOps.test_linear_module Imported from OSS Reviewed By: dagitses Differential Revision: D34591251 fbshipit-source-id: 39144485f9954ee1830c8b414e724560fd7e47bf (cherry picked from commit b97a39b4d9df00e045fab4c01eca88e562ca2c02)

References

#74332 - Merge master into lazy_tensor_staging

Author

andrewor14

Committer

pytorchmergebot

Parents

9929a9fc

pytorch cedce3be - [Quant][fx] Add lowering for Linear-Bn1d in QAT mode (#73509)

pytorch
cedce3be - [Quant][fx] Add lowering for Linear-Bn1d in QAT mode (#73509)