diffusers
4b27c4a4
- [feat] implement `record_stream` when using CUDA streams during group offloading (#11081)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Previous Change (CTRL+↑)
Next Change (CTRL+↓)
Expand Context Lines
Collapse Context Lines
Hide Minimap (CTRL+M)
Commit
115 days ago
[feat] implement `record_stream` when using CUDA streams during group offloading (#11081) * implement record_stream for better performance. * fix * style. * merge #11097 * Update src/diffusers/hooks/group_offloading.py Co-authored-by: Aryan <aryan@huggingface.co> * fixes * docstring. * remaining todos in low_cpu_mem_usage * tests * updates to docs. --------- Co-authored-by: Aryan <aryan@huggingface.co>
References
#11081 - [feat] implement `record_stream` when using CUDA streams during group offloading
Author
sayakpaul
Parents
5d49b3e8
Files
4
docs/source/en/optimization
memory.md
src/diffusers
hooks
group_offloading.py
models
modeling_utils.py
tests/models
test_modeling_common.py
Loading