Megatron-DeepSpeed
[BNB] integrate `StableEmbeding` into `VocabParallelEmbedding` logic
#182
Merged

Loading