diffusers
Stable Audio integration
#8716
Merged

Stable Audio integration #8716

ylacombe
ylacombe WIP modeling code and pipeline
6151db56
HuggingFaceDocBuilderDev
yiyixuxu
ylacombe
sayakpaul
sayakpaul commented on 2024-06-27
yiyixuxu
ylacombe add custom attention processor + custom activation + add to init
656561b7
ylacombe correct ProjectionModel forward
819d7468
ylacombe add stable audio to __initèè
8a1a9d88
ylacombe add autoencoder and update pipeline and modeling code
960339dc
ylacombe add half Rope
51c838f4
ylacombe add partial rotary v2
87f1e261
ylacombe add temporary modfis to scheduler
2f2bb8a0
ylacombe
ylacombe commented on 2024-07-09
sayakpaul
sayakpaul commented on 2024-07-10
ylacombe add EDM DPM Solver
dc3f0eb1
ylacombe remove TODOs
07fc3c37
ylacombe clean GLU
b49a3d5f
ylacombe remove att.group_norm to attn processor
d1b3e207
ylacombe revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
23be1a3a
yiyixuxu
yiyixuxu commented on 2024-07-12
ylacombe refactor GLU -> SwiGLU
9d324088
ylacombe Merge branch 'main' into add-stable-audio
661d4f19
ylacombe remove redundant args
3689af07
ylacombe add channel multiples in autoencoder docstrings
282e4788
ylacombe changes in docsrtings and copyright headers
c9fef252
ylacombe clean pipeline
e51ffb20
ylacombe further cleaning
ab6824c6
ylacombe remove peft and lora and fromoriginalmodel
eeb19fee
ylacombe Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
a43dfc51
ylacombe make style
e7185e56
ylacombe dummy models
3c6715e3
ylacombe fix copied from
14fa2bf6
ylacombe add fast oobleck tests
21d0171b
ylacombe add brownian tree
9cc7c02b
ylacombe oobleck autoencoder slow tests
c5eeafef
ylacombe remove TODO
0a2d065a
ylacombe fast stable audio pipeline tests
29e794b9
ylacombe add slow tests
1bad2878
ylacombe make style
cf15409a
ylacombe add first version of docs
dec61b31
ylacombe wrap is_torchsde_available to the scheduler
1961cc9e
ylacombe fix slow test
3c7df741
ylacombe test with input waveform
92392fda
ylacombe add input waveform
d826f0fd
ylacombe remove some todos
94c2a25a
ylacombe create stableaudio gaussian projection + make style
ad8660e3
ylacombe add pipeline to toctree
55b2a148
ylacombe fix copied from
42a05c58
ylacombe ylacombe changed the title [WIP] Stable Audio integration Stable Audio integration 1 year ago
ylacombe
ylacombe commented on 2024-07-18
ylacombe Merge branch 'huggingface:main' into add-stable-audio
8919ba03
ylacombe make quality
2df8e416
yiyixuxu
yiyixuxu commented on 2024-07-19
ylacombe refactor timestep_features->time_proj
68a5b56a
ylacombe refactor joint_attention_kwargs->cross_attention_kwargs
a81f46d7
ylacombe remove forward_chunk
8e910d34
ylacombe move StableAudioDitModel to transformers folder
406f02a1
ylacombe correct convert + remove partial rotary embed
3a1dddba
ylacombe apply suggestions from yiyixuxu -> removing attn.kv_heads
c44d0a43
ylacombe remove temb
e5859f1c
ylacombe remove cross_attention_kwargs
d35451df
ylacombe further removal of cross_attention_kwargs
76debd5b
ylacombe remove text encoder autocast to fp16
acde6d52
ylacombe continue removing autocast
566972d6
ylacombe make style
f187d65a
ylacombe Merge branch 'huggingface:main' into add-stable-audio
af4f2ab8
ylacombe refactor how text and audio are embedded
8aa2e11e
ylacombe add paper
58ca32c5
ylacombe update example code
a4b69307
ylacombe make style
c0873dc9
yiyixuxu yiyixuxu requested a review from sayakpaul sayakpaul 1 year ago
ylacombe unify projection model forward + fix device placement
bc369337
ylacombe make style
f318e15f
ylacombe remove fuse qkv
8382156c
ylacombe Merge branch 'huggingface:main' into add-stable-audio
6ff9cf6a
sayakpaul
sayakpaul commented on 2024-07-25
ylacombe apply suggestions from review
f91b0849
yiyixuxu
yiyixuxu commented on 2024-07-26
ylacombe Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
29dc552c
sayakpaul
sayakpaul commented on 2024-07-26
sayakpaul
sayakpaul commented on 2024-07-26
ylacombe make style
ff620351
ylacombe smaller models in fast tests
d61a1a9e
ylacombe pass sequential offloading fast tests
f1c95853
ylacombe add docs for vae and autoencoder
88933735
ylacombe Merge branch 'main' into add-stable-audio
0b938042
ylacombe make style and update example
264dd6df
yiyixuxu
yiyixuxu approved these changes on 2024-07-29
yiyixuxu
yiyixuxu commented on 2024-07-29
ylacombe remove useless import
0277c7fa
ylacombe add cosine scheduler
1565d8ae
ylacombe dummy classes
d820e688
ylacombe cosine scheduler docs
fea9f8e2
ylacombe Merge branch 'main' into add-stable-audio
8abdb61f
yiyixuxu
yiyixuxu commented on 2024-07-30
ylacombe better description of scheduler
81dedd91
ylacombe Merge branch 'huggingface:main' into add-stable-audio
6d5d663c
sayakpaul sayakpaul merged 69e72b1d into main 1 year ago
sayakpaul
tin2tin

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone