Megatron-DeepSpeed
e7a12e73
- Turns out there's no issue with the way we build prefix lm
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Turns out there's no issue with the way we build prefix lm
References
#209 - Checking we use fused kernels to compute scaled masked softmax on prefix lm
#213 - Checking we use fused kernels to compute scaled masked softmax on prefix lm
Author
thomasw21
Parents
46d5c334
Loading