DeepSpeed
Transformer kernel/fix layer norm
#1587
Merged

Loading