diffusers
4b27c4a4 - [feat] implement `record_stream` when using CUDA streams during group offloading (#11081)

Commit
115 days ago
[feat] implement `record_stream` when using CUDA streams during group offloading (#11081) * implement record_stream for better performance. * fix * style. * merge #11097 * Update src/diffusers/hooks/group_offloading.py Co-authored-by: Aryan <aryan@huggingface.co> * fixes * docstring. * remaining todos in low_cpu_mem_usage * tests * updates to docs. --------- Co-authored-by: Aryan <aryan@huggingface.co>
Author
Parents
  • docs/source/en/optimization
    • File
      memory.md
  • src/diffusers
    • hooks
      • File
        group_offloading.py
    • models
      • File
        modeling_utils.py
  • tests/models
    • File
      test_modeling_common.py