transformers
739332cd - DistributedConfig + shard-on-read loading

Commit
1 day ago
DistributedConfig + shard-on-read loading - DtensorShardOperation for range-math shard-on-read - spawn_materialize() enhancements - from_pretrained wiring for distributed config - Shard operation helpers in tensor_parallel - Shard-on-read and LoadStateDictConfig tests
Author
Parents
Loading