pytorch
d8189db8 - [quant][fx2trt] Generate engine graph for explicit quant/implicit quant and fp16 graph (#65289)

Commit

3 years ago

[quant][fx2trt] Generate engine graph for explicit quant/implicit quant and fp16 graph (#65289) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65289 Turn on VERBOSE logging and use engine visualizer to generate the graph. Runtime: ``` explicit quant result diff max tensor(0.0771) implicit quant result diff max tensor(0.1909) trt fp16 time (ms/iter) 1.0740923881530762 trt int8 time (ms/iter) 0.5288887023925781 trt implicit int8 time (ms/iter) 0.6334662437438965 PyTorch time (CUDA) (ms/iter) 4.448361396789551 PyTorch time (CPU) (ms/iter) 45.13296604156494 ``` Generated Graphs: ``` explicit int8: https://www.internalfb.com/intern/graphviz/?paste=P458669571 implicit int8: https://www.internalfb.com/intern/graphviz/?paste=P458669656 fp16: https://www.internalfb.com/intern/graphviz/?paste=P458669708 ``` Test Plan: ``` buck run mode/opt -c python.package_style=inplace caffe2:fx2trt_quantized_resnet_test 2>log buck run //deeplearning/trt/fx2trt/tools:engine_layer_visualize -- --log_file log ``` Reviewed By: 842974287 Differential Revision: D30955035 fbshipit-source-id: 24949458ad9823fb026d56d78a6ee1c6874b6034

References

#65112 - [LTC] Merge master

Author

jerryzh168

Committer

facebook-github-bot

Parents

7f8d622d

pytorch d8189db8 - [quant][fx2trt] Generate engine graph for explicit quant/implicit quant and fp16 graph (#65289)

pytorch
d8189db8 - [quant][fx2trt] Generate engine graph for explicit quant/implicit quant and fp16 graph (#65289)