transformers
Add layer_idx to CrossAttention of GPT2 model
#15730
Merged

Loading