diffusers
bd78f63a - Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463)

Commit

2 years ago

Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463) Release large tensors in attention (as soon as they're no longer required). Reduces peak VRAM by nearly 2 GB for 1024x1024 (even after slicing), and the savings scale up with image size.

References

#3463 - Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary)

Author

cmdr2

Parents

3ebd2d1f

diffusers bd78f63a - Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463)

diffusers
bd78f63a - Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463)