transformers
3bd1a0dd - [model loading] don't `gc.collect()` if only 1 shard is used (#36721)

Commit
277 days ago
[model loading] don't `gc.collect()` if only 1 shard is used (#36721) * don't gc collect if 1 shard is used * delete state dict anyways
Author
Parents
Loading