transformers
e01a61ae
- FSDP grad accum fix (#34645)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
FSDP grad accum fix (#34645) * add gradient accumulation steps tests for fsdp * invert no_sync context to fix training for fsdp
Author
winglian
Committer
ArthurZucker
Parents
ccbd57a8
Loading