transformers
54833886 - Update modeling_gpt_neox.py (#17575)

Commit
3 years ago
Update modeling_gpt_neox.py (#17575) I'm guessing that the intention was to have the `_no_split_modules` class attribute for `GPTNeoXPreTrainedModel` to be set to `["GPTNeoXLayer"]`, akin to how its set as `["GPTJBlock"]` for `GPTJPreTrainedModel`. If this is incorrect, please feel free to just close the PR. Thanks!
Author
Parents
Loading