Reland: [Autograd] Use in-place input accumulation fast path for dense Tensors. (#90217)
Identical to https://github.com/pytorch/pytorch/pull/88339 except with a `.has_storage()` check before `.storage()`.
Differential Revision: [D41737935](https://our.internmc.facebook.com/intern/diff/D41737935/)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/90217
Approved by: https://github.com/ngimel