DeepSpeed
6763e2de - add lm_head and embed_out tensor parallel (#3962)

Commit

2 years ago

add lm_head and embed_out tensor parallel (#3962) * add lm_head and embed_out tensor parallel * fix load lm_head.weight name issue * replace all_reduce with inference_all_reduce * refactor lm_head tensor parallel --------- Co-authored-by: Chen, Zhenhuan <zhenhuan.chen@intel.com>

References

#3962 - add lm_head and embed_out tensor parallel

Author

Yejing-Lai

Parents

6b634d0e

DeepSpeed 6763e2de - add lm_head and embed_out tensor parallel (#3962)

DeepSpeed
6763e2de - add lm_head and embed_out tensor parallel (#3962)