DeepSpeed
generalize deepspeed linear and implement it for non cuda systems
#6932
Merged

Loading