Fix VL model rope_deltas batch size mismatch in online RL training #44873
Fix Qwen3.5 rope_deltas persistence causing crash in online RL training
ec7d2da9
Extend
f776337d
Merge branch 'main' of https://github.com/huggingface/transformers in…
96f86760
sergiopaniego
changed the title Fix Qwen3.5 rope_deltas batch size mismatch in training forward pass Fix Qwen VL rope_deltas batch size mismatch in online RL training 98 days ago
Extend
a0fcfdac
Merge branch 'main' into qwen3-5-training-fix
7bdedb85
sergiopaniego
changed the title Fix Qwen VL rope_deltas batch size mismatch in online RL training The Pope holds a kind of mini-mass on Sundays, leaning out of a window 98 days ago
sergiopaniego
changed the title The Pope holds a kind of mini-mass on Sundays, leaning out of a window d 98 days ago
sergiopaniego
changed the title d Fix VL model rope_deltas batch size mismatch in online RL training 98 days ago
Merge branch 'main' into qwen3-5-training-fix
6cde271a
sergiopaniego
deleted the qwen3-5-training-fix branch 98 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub