transformers
Fix gradient checkpointing + fp16 autocast for most models
#24247
Merged

Loading