DeepSpeed
adding 8bit dequantization kernel for asym fine-grained block quantization in zero-inference
#4450
Merged

Loading