onnxruntime
Stable Diffusion 3.x and Flux Optimization
#22986
Merged

Stable Diffusion 3.x and Flux Optimization #22986

tianleiwu merged 28 commits into main from tlwu/sd3_optimum
tianleiwu
tianleiwu initial
6fb73696
tianleiwu sd3.x and flux
9b2dcc0d
github-advanced-security
github-advanced-security commented on 2024-12-02
tianleiwu tianleiwu marked this pull request as draft 1 year ago
tianleiwu update FastGelu and RMSNorm fusions
7f925cef
github-advanced-security
github-advanced-security commented on 2024-12-05
tianleiwu support Reciprocal in RMSNorm fusion
cf259e1b
tianleiwu match_child_path interface change
b38f12eb
tianleiwu clean up
a58b68cd
tianleiwu MHA fusion for MMDit
c7317cbd
tianleiwu cuda layernorm support broadcast
2f5b9b9c
tianleiwu force fuse layernorm
699a64cf
tianleiwu refactoring
c1d01600
github-advanced-security
github-advanced-security commented on 2024-12-15
ACinfr
ACinfr commented on 2024-12-16
tianleiwu mha fusion for flux
1b9ea543
github-actions
github-actions commented on 2024-12-19
github-advanced-security
github-advanced-security commented on 2024-12-19
tianleiwu remove transpose for query
5528276b
github-advanced-security
github-advanced-security commented on 2024-12-20
tianleiwu t5 optimization and mixed precision conversion
89950d13
tianleiwu fix node name
c8691511
tianleiwu Add option to use bfloat16
84b1a515
tianleiwu fix attention
b7041d1e
tianleiwu update node block list of t5 encoder
455a3ea9
tianleiwu benchmark torch eager mode
dad0ac40
tianleiwu update comment
84005580
tianleiwu benchmark torch compile
9e43e206
tianleiwu refine benchmark_flux.sh
4bf9f252
tianleiwu Merge branch 'main' into tlwu/sd3_optimum
913c6eda
tianleiwu undo layer norm kernel
a47b6af5
tianleiwu CMAKE_CUDA_ARCHITECTURES=native
55178d67
tianleiwu Merge branch 'main' into tlwu/sd3_optimum
dac8ea7d
tianleiwu add tests
ebade480
tianleiwu tianleiwu changed the title [WIP] Stable Diffusion 3.x and Flux Optimization Stable Diffusion 3.x and Flux Optimization 1 year ago
tianleiwu tianleiwu marked this pull request as ready for review 1 year ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
tianleiwu tianleiwu requested a review from jiafatom jiafatom 1 year ago
github-advanced-security
github-advanced-security commented on 2025-01-12
jiafatom
jiafatom commented on 2025-01-12
tianleiwu update tests
fd227bb3
kunal-vaishnavi
kunal-vaishnavi commented on 2025-01-12
kunal-vaishnavi
kunal-vaishnavi commented on 2025-01-12
kunal-vaishnavi
kunal-vaishnavi commented on 2025-01-12
tianleiwu undo some change (move to another PR)
87bd3ecc
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2025-01-14
tianleiwu tianleiwu merged 6550f4b3 into main 1 year ago
tianleiwu tianleiwu deleted the tlwu/sd3_optimum branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone