pytorch-lightning
Rudimentary support for weight averaging (EMA) with FSDP
#21414
Open

Commits
  • Summon all model parameters before updating the average model, when using FSDP
    Seppo Enarvi committed 6 days ago
Loading