pytorch
f1203356 - [static runtime] dequantize out variant (#67873)

Commit
3 years ago
[static runtime] dequantize out variant (#67873) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67873 Add out variant for aten::dequantize Test Plan: Test on inline_cvr model ``` MKL_NUM_THREADS=1 OMP_NUM_THREADS=1 numactl -m 0 -C 3 ./buck-out/opt/gen/caffe2/caffe2/fb/predictor/ptvsc2_predictor_bench --scripted_model=/data/users/ansha/tmp/adfinder/294738512/294738512_0.predictor.disagg.local --recordio_inputs=/data/users/ansha/tmp/adfinder/294738512/294738512_0_local.inputs.recordio --pt_enable_static_runtime=1 --compare_results=1 --iters=5 --warmup_iters=5 --num_threads=1 --do_profile=1 --method_name=local.forward --set_compatibility --do_benchmark=1 --recordio_use_ivalue_format=1 ``` Before: 0.047472 ms. 0.409729%. aten::dequantize (9 nodes) After 0.0307179 ms. 0.267204%. static_runtime::dequantize_copy (9 nodes, out variant) Reviewed By: hlu1 Differential Revision: D32187063 fbshipit-source-id: 1fec6b74c7d3f25d0f445775c4558d30c55dcece
Author
Parents
Loading