Megatron-DeepSpeed
c3be5d3f
- Combine Specs (#304)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Combine Specs (#304) * Add support for weighted train * Combine attn_mask dropping & data fromat specs Co-authored-by: thomasw21 <24695242+thomasw21@users.noreply.github.com>
References
#304 - Combine Specs
Author
Muennighoff
Parents
43ab0e08
Loading