Add moondream and llava from huggingface transformers (#2176)
Summary:
For llava model:
* The CPU accuracy test fail because of CI runner CPU memory OOM
* The GPU accuracy test fail because of GPU CUDA OOM
Pull Request resolved: https://github.com/pytorch/benchmark/pull/2176
Reviewed By: aaronenyeshi
Differential Revision: D54075890
Pulled By: xuzhao9
fbshipit-source-id: 99a256616c228303e366734d3f33bcf2b63933a6