DeepSpeed
f03d6053 - Detects tensor outputs hidden inside dictionary returned by modules and registers Z3 hooks correctly. Gives the ability to assign different model facing data types for Z3 parameters during Init. The dtype of partitioned parameters is still controlled by DS config

Commit
4 years ago
Detects tensor outputs hidden inside dictionary returned by modules and registers Z3 hooks correctly. Gives the ability to assign different model facing data types for Z3 parameters during Init. The dtype of partitioned parameters is still controlled by DS config
Author
Parents
Loading