transformers
[`gradient_checkpointing`] default to use it for torch 2.3
#28538
Merged

Loading