[SANA-Video] Adding 5s pre-trained 480p SANA-Video inference (#12584)
* 1. add `SanaVideoTransformer3DModel` in transformer_sana_video.py
2. add `SanaVideoPipeline` in pipeline_sana_video.py
3. add all code we need for import `SanaVideoPipeline`
* add a sample about how to use sana-video;
* code update;
* update hf model path;
* update code;
* sana-video can run now;
* 1. add aspect ratio in sana-video-pipeline;
2. add reshape function in sana-video-processor;
3. fix convert pth to safetensor bugs;
* default to use `use_resolution_binning`;
* make style;
* remove unused code;
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/pipelines/sana/pipeline_sana_video.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
* Update src/diffusers/pipelines/sana/pipeline_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/pipelines/sana/pipeline_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* support `dispatch_attention_fn`
* 1. add sana-video markdown;
2. fix typos;
* add two test case for sana-video (need check)
* fix text-encoder in test-sana-video;
* Update tests/pipelines/sana/test_sana_video.py
* Update tests/pipelines/sana/test_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update tests/pipelines/sana/test_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update tests/pipelines/sana/test_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update tests/pipelines/sana/test_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update tests/pipelines/sana/test_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/pipelines/sana/pipeline_sana_video.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/video_processor.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* make style
make quality
make fix-copies
* toctree yaml update;
* add sana-video-transformer3d markdown;
* Apply style fixes
---------
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>