Megatron-DeepSpeed
e23393fb - Fix tflops glu computation (#283)

Commit
3 years ago
Fix tflops glu computation (#283) * Fix tflops glu computation * Explain GLU TFLOPs difference * Fix typo * Specify MLP Co-authored-by: Thomas Wang <24695242+thomasw21@users.noreply.github.com> Co-authored-by: Thomas Wang <24695242+thomasw21@users.noreply.github.com>
Author
Parents
Loading