DeepSpeed
8920308c - Fix the tensor-slicing copy for qkv parameters (#2198)

Commit
3 years ago
Fix the tensor-slicing copy for qkv parameters (#2198) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Parents
Loading