pytorch
1be3549a - Enable replicated embedding in SPMD for NLP models (#98686)

Commit

1 year ago

Enable replicated embedding in SPMD for NLP models (#98686) For models like NanoGPT, embeddings are replicated and input ids are sharded. In this case, output lookups should be sharded to match ids. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98686 Approved by: https://github.com/yifuwang

Author

mrshenli

Committer

pytorchmergebot

Parents

fdb04c6a

pytorch 1be3549a - Enable replicated embedding in SPMD for NLP models (#98686)

pytorch
1be3549a - Enable replicated embedding in SPMD for NLP models (#98686)