transformers
Wrap `_prepare_4d_causal_attention_mask` as a leaf function
#27236
Merged

Loading