DeepSpeed
567f97b2
- load linear layer weight with given dtype (#4044)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
load linear layer weight with given dtype (#4044) bf16 inference fails due to data type mismatch as half is default value --------- Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
References
#4044 - load linear layer weight with given dtype
Author
polisettyvarma
Parents
61daaa1e
Loading