Megatron-DeepSpeed
349c45c7 - skip params on bwd; add argmin/argmax

Commit
4 years ago
skip params on bwd; add argmin/argmax
Author
Parents
Loading