Megatron-DeepSpeed
f3307058 - Fix adaptive_seq_len via resetting activation shape

Commit
3 years ago
Fix adaptive_seq_len via resetting activation shape
Author
Parents
Loading