DeepSpeed
Fix checkpoint conversion when model layers share weights
#3825
Merged

Loading