Port pytorch/benchmarks/dynamo to the dynamo userbenchmark (#1826)
Summary:
Copy the code from https://github.com/pytorch/pytorch/tree/main/benchmarks/dynamo to make it as a torchbench userbenchmark.
Need to refactor the code to work on pytorch nightly release.
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1826
Test Plan:
```
$ python run_benchmark.py dynamo --performance --only resnet50 --training
WARNING:userbenchmark.dynamo.common:torch.cuda.is_available() == False, using CPU
loading model: 0it [00:18, ?it/s]
cpu train resnet50
15/ 16 +1 frames -2s 12 graphs 3 graph calls 231/ 581 = 40% ops 32% time
```
```
$ python run_benchmark.py dynamo --performance --only resnet50 --training
loading model: 0it [00:06, ?it/s]
cuda train resnet50
22/ 22 +0 frames 2s 4 graphs 4 graph calls 233/ 586 = 40% ops 51% time
```
Reviewed By: FindHao
Differential Revision: D48407541
Pulled By: xuzhao9
fbshipit-source-id: 42fb31e710d5cc92dba7b4976f1bb1e4a3c10578