NucleusMoE-Image (#13317)
* adding NucleusMoE-Image model
* update system prompt
* Add text kv caching
* Class/function name changes
* add missing imports
* add RoPE credits
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* update defaults
* Update src/diffusers/pipelines/nucleusmoe_image/pipeline_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* review updates
* fix the tests
* clean up
* update apply_text_kv_cache
* SwiGLUExperts addition
* fuse SwiGLUExperts up and gate proj
* Update src/diffusers/hooks/text_kv_cache.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/hooks/text_kv_cache.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/hooks/text_kv_cache.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/hooks/text_kv_cache.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* Update src/diffusers/models/transformers/transformer_nucleusmoe_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
* _SharedCacheKey -> TextKVCacheState
* Apply style fixes
* Run python utils/check_copies.py --fix_and_overwrite
python utils/check_dummies.py --fix_and_overwrite
* Apply style fixes
* run `make fix-copies`
* fix import
* refactor text KV cache to be managed by StateManager
---------
Co-authored-by: Murali Nandan Nagarapu <nmn@withnucleus.ai>
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>