pytorch
d0dc7cb7 - Reland "[JIT] during freezing, cast optional bias to half if weight is half"

Commit View On GitHub

Commit

2 years ago

Reland "[JIT] during freezing, cast optional bias to half if weight is half" Original PR: #77295 Original commit message: On GPU, conv errors if not all its inputs have the same dtype. In the case of autocasting during freezing, what we see is: 1) inputs to conv are casted to half 2) inputs to batchnorm are not casted, so many are still floats 3) we try to fold conv + batchnorm, by finding different weight and bias such that conv(input, new_weight, new_bias) is equivalent to the original conv -> batchnorm. If conv previously had an optional bias, then during freezing we will temporarily create a zero-valued bias as a placeholder for conv_bias. We want to construct it to have the same dtype as the weight input to conv, to avoid errors on GPU. Reland changes: There's a memory leak from cuda caching allocator that is a side effect of this fix. The memory leak causes the test to fail, though for some reason it didn't fail on CI in the last PR. This skips the tests for now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77617 Approved by: https://github.com/eellison

Author

davidberard98

Committer

pytorchmergebot

Parents

9cc92d53

pytorch d0dc7cb7 - Reland "[JIT] during freezing, cast optional bias to half if weight is half"

Commit

pytorch
d0dc7cb7 - Reland "[JIT] during freezing, cast optional bias to half if weight is half"