feed model merge net lower benchmark (#65191)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65191
Test Plan:
run command:
buck run mode/opt -c python.package_style=inplace hpc/new/models/feed/benchmark:feed_lower_benchmark
example output:
Eager, BS: 2048, TFLOP/s: 253.25, Time per iter: 4.49ms, QPS: 456289.25
TensorRT, BS: 2048, TFLOP/s: 162.30, Time per iter: 7.00ms, QPS: 292426.58
Reviewed By: yinghai
Differential Revision: D31010288
fbshipit-source-id: f30b520eca9508439588bcf48476b1b1edfb09af