Megatron-DeepSpeed
2aec997f
- First fix
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
First fix
References
#213 - Checking we use fused kernels to compute scaled masked softmax on prefix lm
Author
thomasw21
Parents
071be146
Loading