DeepSpeed
Fix gpt-Neox rotary embedding implementation
#2782
Merged

Loading