Move clip to canary_models (#1837)
Summary:
This is a follow-up of https://github.com/pytorch/benchmark/pull/1831.
Clip depends on torchmultimodal, which further depends on torchtext:
https://github.com/pytorch/benchmark/actions/runs/5895605535/job/15991723413
Move the clip model to canary_models.
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1837
Reviewed By: msaroufim
Differential Revision: D48480662
Pulled By: xuzhao9
fbshipit-source-id: da57c8aa3df4c520c6afae4342d365b166c8b2dd