Add stable diffusion model benchmark (#1204)
Summary:
Add stable diffusion model from huggingface diffuser repository.
Running eval method from stable_diffusion on cuda in eager mode with input batch size 1.
GPU Time: 2921.900 milliseconds
CPU Total Wall Time: 2921.957 milliseconds
GPU 0 Peak Memory: 8.4949 GB
CPU Peak Memory: 3.9766 GB
Fixes https://github.com/pytorch/benchmark/issues/1374
To adopt it to the core model set, we still need to do the following:
1. Disable loading model from checkpoint, if checkpointing doesn't affect data flow and performance
2. Run only 1 iteration in the eval
3. Disable noisy output
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1204
Reviewed By: msaroufim
Differential Revision: D43909979
Pulled By: xuzhao9
fbshipit-source-id: cddbc1612a3315b2205eefdadc9ab1d80cb308d2