Megatron-DeepSpeed
add `pad-vocab-size-to` argument and tests
#255
Merged

Loading