DeepSpeed
6e1a6801 - deepspeed/launcher: add launcher_helper as each rank's start portal (#4699)

Commit
1 year ago
deepspeed/launcher: add launcher_helper as each rank's start portal (#4699) File Changes: multinode_runner.py: modify mpich runner to use launcher_helper launcher_helper.py: init script to map env variables Descriptions: Previous mpich runner would cause linux command line reaching size limitations when rank number is extremely higher. After discussion, we want to optimize it by using a helper script as each rank's start portal, which maps env variables such as rank, local_rank for deepspeed. So far we only use it for mpich runner, but it is made to be extendable, any runner could be added if facing similar situation. Only necessary args are passed to helper script. Let us know if there is any suggestion. --------- Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>
Author
Parents
Loading