transformers
Fix VL model rope_deltas batch size mismatch in online RL training
#44873
Merged

Loading