pytorch
08659645 - [optim] _actually_ default to foreach (#95862)

Commit

2 years ago

[optim] _actually_ default to foreach (#95862) * [optim] include nn.Parameter as foreach supported (#95811) This PR is a result of a realization that models are NOT subscribed to the foreach defaulting as have been claimed on our documentation for months now. BIG OOPS. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95811 Approved by: https://github.com/albanD * [optim] Widen the cases for defaulting to foreach (#95820) Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95820 Approved by: https://github.com/albanD

References

#95862 - [optim] _actually_ default to foreach

Author

janeyx99

Parents

f18ac1b3

pytorch 08659645 - [optim] _actually_ default to foreach (#95862)

pytorch
08659645 - [optim] _actually_ default to foreach (#95862)