DeepSpeed
Ds-inference Int8 support through ZeroQuant technology
#2217
Merged

Loading