DeepSpeed
920e6be2 - Fix the tensor-slicing copy for qkv parameters

Commit
3 years ago
Fix the tensor-slicing copy for qkv parameters
Author
Reza Yazdani
Parents
Loading