auto-round
support deepspeed LinearLayer and LinearAllreduce
#698
Merged

Loading