transformers
Make gradient-checkpoint enabling tolerant of models without get_input_embeddings
#42558
Merged

Loading