transformers
[model loading] don't `gc.collect()` if only 1 shard is used
#36721
Merged

Commits
  • don't gc collect if 1 shard is used
    gante committed 285 days ago
  • delete state dict anyways
    gante committed 285 days ago
  • Merge branch 'main' into dont_gc_collect_on_single_shard
    gante committed 285 days ago
Loading