reorganize arguments (#1299)
Summary:
- Deprecate `model_flops`
- Deprecate redundant flags such as `gpu_peak_mem_enabled`
- Reorganize `metrics-gpu-backend`. By default, we will use nvml to collect gpu_peak_mem, fvcore to collect flops, and psutil.Process to collect cpu_peak_mem.
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1299
Reviewed By: xuzhao9
Differential Revision: D41239883
Pulled By: FindHao
fbshipit-source-id: 2a25f07219ad4da2369bac0e5a33a2ee901e2413