diffusers
do not scale the initial global step by gradient accumulation steps when loading from checkpoint
#3506
Merged

Loading