diffusers
Add LTX 2.0 Video Pipelines
#12915
Merged

Add LTX 2.0 Video Pipelines #12915

dg845 merged 103 commits into main from ltx-2-transformer
dg845
dg845 Initial LTX 2.0 transformer implementation
aa602ac4
dg845 Add tests for LTX 2 transformer model
b3096c3c
dg845 Get LTX 2 transformer tests working
980591de
dg845 Rename LTX 2 compile test class to have LTX2
e100b8f2
dg845 Remove RoPE debug print statements
780fb61d
dg845 Get LTX 2 transformer compile tests passing
5765759c
dg845 Fix LTX 2 transformer shape errors
aeecc4d7
dg845 Initial script to convert LTX 2 transformer to diffusers
a5f2d2da
dg845 Add more LTX 2 transformer audio arguments
d86f89dd
dg845 Allow LTX 2 transformer to be loaded from local path for conversion
57a8b9c3
dg845 Improve dummy inputs and add test for LTX 2 transformer consistency
a7bc052e
dg845 Fix LTX 2 transformer bugs so consistency test passes
bda3ff13
dg845 Initial implementation of LTX 2.0 video VAE
269cf7b4
dg845 Explicitly specify temporal and spatial VAE scale factors when conver…
baf23e2d
dg845 Add initial LTX 2.0 video VAE tests
5b950d6f
dg845 Add initial LTX 2.0 video VAE tests (part 2)
491aae08
dg845 Get diffusers implementation on par with official LTX 2.0 video VAE i…
a748975a
dg845 Initial LTX 2.0 vocoder implementation
c6a11a55
dg845 Merge pull request #3 from huggingface/ltx-2-vocoder
8bfeb4af
dg845 Merge pull request #2 from huggingface/ltx-2-video-vae
b1cf6ff8
dg845 Use RMSNorm implementation closer to original for LTX 2.0 video VAE
6c56954f
sayakpaul start audio decoder.
b34ddb17
sayakpaul init registration.
f4c2435d
sayakpaul up
e54cd6bb
sayakpaul simplify and clean up
907896d5
sayakpaul up
4904fd6f
dg845 Initial LTX 2.0 text encoder implementation
0028955c
dg845 Rough initial LTX 2.0 pipeline implementation
d0f9cdaa
sayakpaul up
5f0f2a03
sayakpaul up
58257eb0
sayakpaul up
059999a3
sayakpaul up
8134da6a
sayakpaul resolve conflicts.
409d651b
dg845 Merge pull request #5 from huggingface/audio-decoder
7bb4cf76
dg845 Add imports for LTX 2.0 Audio VAE
5f7e43d1
dg845 Conversion script for LTX 2.0 Audio VAE Decoder
d303e2a6
dg845 Merge branch 'ltx-2-transformer' into ltx-2-t2v-pipeline
ae3b6e7c
dg845 Add Audio VAE logic to T2V pipeline
54bfc5d6
dg845 Duplicate scheduler for audio latents
6e6ce205
dg845 Support num_videos_per_prompt for prompt embeddings
cbb10b8d
dg845 LTX 2.0 scheduler and full pipeline conversion
595f485a
dg845 Add script to test full LTX2Pipeline T2V inference
3bf73697
dg845 Fix pipeline return bugs
fa7d9f77
dg845 Add LTX 2 text encoder and vocoder to ltx2 subdirectory __init__
a56cf234
dg845 Fix more bugs in LTX2Pipeline.__call__
90edc6ab
dg845 Improve CPU offload support
1484c431
dg845 Fix pipeline audio VAE decoding dtype bug
f9b94765
dg845 Fix video shape error in full pipeline test script
e89d9c19
dg845 Get LTX 2 T2V pipeline to produce reasonable outputs
b5891b19
dg845 Merge pull request #4 from huggingface/ltx-2-t2v-pipeline
0c412974
dg845 Make LTX 2.0 scheduler more consistent with original code
581f21c4
dg845 Fix typo when applying scheduler fix in T2V inference script
e1f0b7e2
sayakpaul Refactor Audio VAE to be simpler and remove helpers (#7)
280e3478
sayakpaul Add support for I2V (#8)
46822c43
dg845 Merge branch 'ltx-2-transformer' into make-scheduler-consistent
6a236a27
dg845 Denormalize audio latents in I2V pipeline (analogous to T2V change) (…
bd607b97
sayakpaul test i2v.
d3f10fe5
dg845 Merge pull request #10 from huggingface/make-scheduler-consistent
aae70b90
dg845 Move Video and Audio Text Encoder Connectors to Transformer (#12)
caae1676
sayakpaul up (#19)
0be4f316
sayakpaul address initial feedback from lightricks team (#16)
c5b52d6c
dg845 When using split RoPE, make sure that the output dtype is same as inp…
2fa4f847
dg845 Fix apply split RoPE shape error when reshaping x to 4D
bff98911
dg845 Add export_utils file for exporting LTX 2.0 videos with audio
cb50cacb
dg845 Merge pull request #20 from huggingface/video-export-utils-file
ce9da5d4
sayakpaul Tests for T2V and I2V (#6)
93a417f2
sayakpaul resolve conflicts.
9b8788cc
sayakpaul up
c039c87b
sayakpaul use export util funcs.
550eca35
dg845 Point original checkpoint to LTX 2.0 official checkpoint
ef199118
dg845 Allow the I2V pipeline to accept image URLs
ace2ee93
dg845 make style and make quality
dd81242e
yiyixuxu
yiyixuxu commented on 2026-01-06
HuggingFaceDocBuilderDev
yiyixuxu
yiyixuxu commented on 2026-01-06
sayakpaul Merge branch 'main' into ltx-2-transformer
2fc57894
JoeGaffney
JoeGaffney commented on 2026-01-06
sayakpaul remove function map.
57ead0b5
sayakpaul remove args.
c39f1b87
sayakpaul update docs.
bdcf23ec
sayakpaul update doc entries.
61e0fb4b
sayakpaul
sayakpaul disable ltx2_consistency test
8c5ab1fd
sayakpaul Merge branch 'main' into ltx-2-transformer
64b48c17
bghira
dg845
bghira
dg845
dg845 Simplify LTX 2 RoPE forward by removing coords is None logic
5e0cf2b2
dg845 make style and make quality
d01a242c
dg845 Support LTX 2.0 audio VAE encoder
79cf6d7b
sayakpaul
sayakpaul commented on 2026-01-07
sayakpaul Merge branch 'main' into ltx-2-transformer
cc28cf76
sayakpaul resolve conflicts
91ee2dd2
dg845 Merge branch 'ltx-2-transformer' of github.com:huggingface/diffusers …
5269ee50
sayakpaul
sayakpaul commented on 2026-01-07
dg845 Apply suggestions from code review
a17f5cb6
dg845 Remove print statement in audio VAE
964f1068
sayakpaul up
4dfe5099
sayakpaul Merge branch 'main' into ltx-2-transformer
249ae1f8
dg845 Fix bug when calculating audio RoPE coords
040c1188
sayakpaul sayakpaul requested a review from yiyixuxu yiyixuxu 85 days ago
sayakpaul Ltx 2 latent upsample pipeline (#12922)
44925cb3
dg845 Fix latent upsampler filename in LTX 2 conversion script
5e500467
dg845 Add latent upsample pipeline to LTX 2 docs
2b85b93e
dg845 Add dummy objects for LTX 2 latent upsample pipeline
40ee3e33
dg845 Set default FPS to official LTX 2 ckpt default of 24.0
99ff722e
dg845 Set default CFG scale to official LTX 2 ckpt default of 4.0
165b9454
dg845 Update LTX 2 pipeline example docstrings
1a4ae58c
dg845 make style and make quality
b4d33df9
dg845 Remove LTX 2 test scripts
724afee9
yiyixuxu
yiyixuxu approved these changes on 2026-01-08
dg845 Fix LTX 2 upsample pipeline example docstring
d24faa71
dg845 Add logic to convert and save a LTX 2 upsampling pipeline
353f0dbd
sayakpaul Merge branch 'main' into ltx-2-transformer
0c9e4e22
dg845 Document LTX2VideoTransformer3DModel forward pass
f85b969a
dg845
dg845 dg845 merged c10bdd9b into main 84 days ago
hannalaguilar
dg845 dg845 deleted the ltx-2-transformer branch 84 days ago
sayakpaul sayakpaul added roadmap

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone