pytorch
5c7e801c - [pytorch][on device quant] Finalize method for ondevice quant (#83571)

Commit

2 years ago

[pytorch][on device quant] Finalize method for ondevice quant (#83571) Summary: After inserting quant dequant nodes in the graph, we need 1. Insert packed param creation and quantized op 2. Create packed_params attribute in the top module. For this we need graph that inlined except for calculate_qparams method calls. But they can be inlined too. So perhaps we need to make sure no other callmethods exist. 3. Insert SetAttr for the packed param 4. Insert GetAttr for the packed param 5. Use GetAttr output for quantized op where applicable, e.g. linear_dynamic The above is added to quantize_<method-name> method created inprevious step. Once the above steps are done clone the method into quantized_<method-name> Modify quantize_<method-name>: 1. Remove all outputs from the method. 2. Run dce 3. Remove all inputs from the method except self. Modify quantized_<method-name>: 1. Remove all packed_param setAttr nodes. 2. Run dce. This should result in removal of all nodes that generate packed param. Test Plan: To be written Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D38771416](https://our.internmc.facebook.com/intern/diff/D38771416) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83571 Approved by: https://github.com/jerryzh168

Author

kimishpatel

Committer

pytorchmergebot

Parents

446afb5f

pytorch 5c7e801c - [pytorch][on device quant] Finalize method for ondevice quant (#83571)

pytorch
5c7e801c - [pytorch][on device quant] Finalize method for ondevice quant (#83571)