transformers
f3b3533e - Fix layerwise GaLore optimizer hard to converge with warmup scheduler (#30372)

Commit
1 year ago
Fix layerwise GaLore optimizer hard to converge with warmup scheduler (#30372) Update optimization.py
Author
Parents
Loading