DeepSpeed
426810a2 - Fix ZeRO parameter initialization for tensors with `requires_grad=True` (#4138)

Commit
2 years ago
Fix ZeRO parameter initialization for tensors with `requires_grad=True` (#4138) * Fix ZeRO parameter initialization for tensors with `requires_grad=True` * Simplify detach logic --------- Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
Loading