DeepSpeed
0a73e6e6 - Container param cleanup + remove qkv_merging (#2780)

Commit
2 years ago
Container param cleanup + remove qkv_merging (#2780) This PR cleans up some container items and removes an unused qkv_merging parameter: - Remove qkv_merging=True from BERT containers - Change containers config object to ds_model_config - Remove qkv_merging param
Author
Parents
Loading