diffusers
[feat] implement `record_stream` when using CUDA streams during group offloading
#11081
Merged

[feat] implement `record_stream` when using CUDA streams during group offloading #11081

sayakpaul merged 17 commits into main from record-streams
sayakpaul
sayakpaul implement record_stream for better performance.
ffce2d19
sayakpaul fix
f25ea18c
sayakpaul sayakpaul requested a review from DN6 DN6 1 year ago
sayakpaul sayakpaul requested a review from a-r-r-o-w a-r-r-o-w 1 year ago
sayakpaul style.
2a28f6df
HuggingFaceDocBuilderDev
a-r-r-o-w
a-r-r-o-w commented on 2025-03-18
sayakpaul merge #11097
41ea4c83
sayakpaul
a-r-r-o-w
a-r-r-o-w commented on 2025-03-18
a-r-r-o-w
a-r-r-o-w commented on 2025-03-18
sayakpaul resolve conflicts.
f5b69b09
sayakpaul Update src/diffusers/hooks/group_offloading.py
9281e84a
sayakpaul
sayakpaul Merge branch 'main' into record-streams
637f84ec
sayakpaul fix conflicts.
612136f8
sayakpaul fixes
d5afea56
sayakpaul
sayakpaul Merge branch 'main' into record-streams
fb59f362
DN6
sayakpaul Merge branch 'main' into record-streams
4a6eeba6
sayakpaul docstring.
87a93fed
sayakpaul remaining todos in low_cpu_mem_usage
1d4ca615
sayakpaul tests
535dcd1b
sayakpaul sayakpaul marked this pull request as ready for review 357 days ago
sayakpaul sayakpaul requested a review from a-r-r-o-w a-r-r-o-w 357 days ago
sayakpaul sayakpaul changed the title [poc] implement `record_stream` when using CUDA streams during group offloading [feat] implement `record_stream` when using CUDA streams during group offloading 357 days ago
DN6
DN6 approved these changes on 2025-04-08
sayakpaul updates to docs.
2ff9112c
sayakpaul Merge branch 'main' into record-streams
b4deedcc
sayakpaul Merge branch 'main' into record-streams
622aba7a
sayakpaul sayakpaul merged 4b27c4a4 into main 357 days ago
sayakpaul sayakpaul deleted the record-streams branch 357 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone