DeepSpeed
920e6be2
- Fix the tensor-slicing copy for qkv parameters
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Fix the tensor-slicing copy for qkv parameters
References
#2228 - Remove the random-generator from context during inference
Author
Reza Yazdani
Parents
0f5c2012
Loading