Megatron-DeepSpeed
8044c7b4
- Update arguments checks.
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Update arguments checks. hidden_size % attention_heads == 0 is handled above when dealing with kv_channels. Adding check for decoder sequence length.
Author
jaredcasper
Parents
2ff004ac
Loading