pytorch
7743ed85 - Don't keep unnecessary saved_inputs alive (#16583)

Commit View On GitHub

Commit

5 years ago

Don't keep unnecessary saved_inputs alive (#16583) Summary: Fixes #16577. This greatly improves memory efficiency of certain ops like Dropout2d. Previously, they were implemented as `input * mask` where mask never requires_grad, but we didn't use that knowledge in forward, and (in case of a in-place dropout) kept input.clone() for the backward, when it would simply get ignored. This patch tries to address this situation by emitting some guards for stores like this, but only if they are as simple, as checking if a single value requires_grad. Interestingly, the same optimizations apply to methods like bmm, baddmm, etc., but _not to mm nor addmm_, because of how their derivatives are defined. Apparently they unnecessarily use `mat1` to compute the derivative of `mat1` just to improve the error message in case `mat1` was sparse. I'd like to apply this optimization to that case, but I don't want to loose the nicer error message, so if anyone has any ideas for solutions, please let me know... Full list of operators affected by this patch: * _nnpack_spatial_convolution * addbmm * addcdiv * addcmul * addmv * addr * baddbmm * bmm * cross * div * dot * fmod * ger * index_add_ * mul * mv * scatter_add_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/16583 Differential Revision: D13900881 Pulled By: gchanan fbshipit-source-id: dd0aeb2ab58c4b6aa95b37b46d3255b3e014291c

Author

apaszke

Committer

facebook-github-bot

Parents

e2a5b203

pytorch 7743ed85 - Don't keep unnecessary saved_inputs alive (#16583)

Commit

pytorch
7743ed85 - Don't keep unnecessary saved_inputs alive (#16583)