transformers
b7164eca - Fix VL model rope_deltas batch size mismatch in online RL training (#44873)

Commit
43 days ago
Fix VL model rope_deltas batch size mismatch in online RL training (#44873) * Fix Qwen3.5 rope_deltas persistence causing crash in online RL training * Extend * Extend
Author
Parents
Loading