pytorch
acdc7549 - [quant][graphmode][fx] Add support for ObservationType.OUTPUT_SHARE_OBSERVE_WITH_INPUT in backend_config_dict (#67210)

Commit

3 years ago

[quant][graphmode][fx] Add support for ObservationType.OUTPUT_SHARE_OBSERVE_WITH_INPUT in backend_config_dict (#67210) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67210 `OUTPUT_SHARE_OBSERVE_WITH_INPUT` is an observation type for operators that would have the same observer/fake_quant instance as output, when quantized, these ops can take quantized Tensor as input and output a quantized Tensor with the same quantization parameters (scale/zero_point etc.) as input Using cat as an example in this PR. Other ops can be added later gradually (together with tests). Test Plan: python test/fx2trt/test_quantize_fx.py TestQuantizeFxTRTOps.test_cat Imported from OSS Reviewed By: vkuzo Differential Revision: D31907243 fbshipit-source-id: 2c7af4a456deb5e6597b0b9cd4e32c5fcdec580b

Author

jerryzh168

Committer

facebook-github-bot

Parents

2bb20c0e

pytorch acdc7549 - [quant][graphmode][fx] Add support for ObservationType.OUTPUT_SHARE_OBSERVE_WITH_INPUT in backend_config_dict (#67210)

pytorch
acdc7549 - [quant][graphmode][fx] Add support for ObservationType.OUTPUT_SHARE_OBSERVE_WITH_INPUT in backend_config_dict (#67210)