transformers
[model loading] don't `gc.collect()` if only 1 shard is used
#36721
Merged

Loading