SD: image encoder bench (#1881)
Summary:
Problem was in our benchmarks we were only benchmarking the text encoder that's not actually the bottleneck so instead now benchmarking in addition the unet
The dynamo runners assume the inputs must be nn modules we might be able to relax this to a HF pipeline object but in the meantime this should be fine
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1881
Reviewed By: xuzhao9
Differential Revision: D49152514
Pulled By: msaroufim
fbshipit-source-id: ab38963d2faabab5f3e3351ec0bca0e5e2d418ef