DeepSpeed
894f21da - Use odd shape tensor to represent parameter data in partitioned state (#981)

Commit
4 years ago
Use odd shape tensor to represent parameter data in partitioned state (#981) * use wierd shaped tensor to avoid silent failures when not registering externel params * fix typo Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Author
Parents
Loading