pytorch
db9a0cf6 - Extend impl_backward to handle non-Tensor outputs (#106800)

Commit View On GitHub

Commit

1 year ago

Extend impl_backward to handle non-Tensor outputs (#106800) Recall that the user must give us a backward function that accepts `(ctx, saved, *grads)`, with one grad per output. Previously, impl_backward only worked for functions that return one or more Tensors. The new semantics are that if the output has: - a TensorList, the backward function provided by the user will receive a List[Tensor] of grads for that output. - a number, the backward function provided by the user will receive None as the grad. Also recall that impl_backward is implemented by registering an autograd.Function to the autograd dispatch key. We needed to make the following changes: - If an output is a TensorList, autograd.Function will ignore it. So we need to tree-flatten it before returning it from the autograd.Function - This means that the autograd.Function receives a flat list of grad during the backwards pass. We need to tree-unflatten it into the correct shape before passing it to the user-defined backward - We modify the logic of output_differentiability. Only Tensor/TensorList outputs can be marked as differentiable. If a TensorList is marked as non-differentiable, then this is equivalent to all Tensors in the list being non-differentiable. There is no finer-grain control over this (to match derivatives.yaml). Test Plan: - There are new `numpy_split_copy` (returns TensorList) and `numpy_split_copy_with_int` (returns (TensorList, int)) operators in custom_op_db - Added tests for output_differentiability into test/test_custom_ops.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/106800 Approved by: https://github.com/soulitzer ghstack dependencies: #106799

Author

zou3519

Committer

pytorchmergebot

Parents

9fcce1ba

pytorch db9a0cf6 - Extend impl_backward to handle non-Tensor outputs (#106800)

Commit

pytorch
db9a0cf6 - Extend impl_backward to handle non-Tensor outputs (#106800)