[refactor] remove conv_cache from CogVideoX VAE (#9524)
* remove conv cache from the layer and pass as arg instead
* make style
* yiyi's cleaner implementation
Co-Authored-By: YiYi Xu <yixu310@gmail.com>
* sayak's compiled implementation
Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com>
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>