pytorch
628940d1 - [quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488)

Commit

3 years ago

[quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62488 Instead of attaching weight observer/fake_quant to the float linear and conv, we can compute the quantization parameters and attach that as a dictionary to these modules so that we can reduce the model size and make the reference module clearer TODO: the numerics for linear and conv in reference quantized model is still not correct since we did not quantize weight, we may explore things like parameterization to implement this support Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D30053979 fbshipit-source-id: b5f8497cf6cf65eec924df2d8fb10a9e154b8cab

References

#63371 - [LTC] Merge master into lazy_tensor_staging

Author

jerryzh168

Committer

alanwaketan

Parents

b310b8d1

pytorch 628940d1 - [quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488)

pytorch
628940d1 - [quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488)