pytorch
5652ab22 - [FSDP] Add `_set_flattened()`; `_is_flattened()` (#85038)

Commit

2 years ago

[FSDP] Add `_set_flattened()`; `_is_flattened()` (#85038) For both exposing the original parameters and for TP integration, we cannot only rely on `isinstance(param, FlatParameter)` to ignore already-flattened parameters in `.named_parameters()`. As a simple workaround, we can mark original parameters or `ShardedTensor`s with an attribute `_fsdp_flattened` (saved as a string variable `FSDP_FLATTENED`) to indicate that the parameter/tensor has already been flattened. This issue only arises for recursive/nested FSDP wrapping. This PR also changes `isinstance(param, FlatParameter)` checks to `type(param) is FlatParameter` because all tensor subclasses that have `_is_param == True` will return `True` for `isinstance(param, <any subclass with _is_param == True>)`. This means that a `ShardedTensor` parameter will return `True` for `isinstance(st, FlatParameter)`, which is not what we want. https://github.com/pytorch/pytorch/blob/5271494ef21ae0140755a41f3b16a8bd745642b6/torch/nn/parameter.py#L8-L10 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85038 Approved by: https://github.com/rohan-varma

Author

awgu

Committer

pytorchmergebot

Parents

0ec19db7

pytorch 5652ab22 - [FSDP] Add `_set_flattened()`; `_is_flattened()` (#85038)

pytorch
5652ab22 - [FSDP] Add `_set_flattened()`; `_is_flattened()` (#85038)