DeepSpeed
f169851c - 100 percent match between DS (base_mlp + base_attn) and HF.

Commit
2 years ago
100 percent match between DS (base_mlp + base_attn) and HF.
Author
Parents
Loading