pytorch
49dcf48e - [PT2][Quant] Change quat conv bn fusion code (#103556)

Commit

1 year ago

[PT2][Quant] Change quat conv bn fusion code (#103556) Summary: Dynamo burn in scalars instead of keeping them on module. This results in quantize_per_tensor and dequantize_per_tensor nodes to have burnt in scale and zero point value, if we trace them scalar. Graph rewrite ignores literals and when match pattern is replaced with replacement pattern, we lose the scale/zp and other values from nodes in original graph and instead get one from replacement graph. This diff fixes that for q/dq per tensor node by manually copying these values over. Note that this is not robust because it works only when there is only a single q/dq node Test Plan: quantization_pt2e Reviewed By: andrewor14 Differential Revision: D46614000 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103556 Approved by: https://github.com/andrewor14

References

gh/mikaylagawarecki/132/base

gh/shunting314/66/base

gh/shunting314/67/base

Author

kimishpatel

Committer

pytorchmergebot

Parents

a60f6dbe

pytorch 49dcf48e - [PT2][Quant] Change quat conv bn fusion code (#103556)

pytorch
49dcf48e - [PT2][Quant] Change quat conv bn fusion code (#103556)