DeepSpeed
0b216378
- increased coverage of QuantLinear test
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
increased coverage of QuantLinear test (w/ and w/o the cuda kernels)
References
#4450 - adding 8bit dequantization kernel for asym fine-grained block quantization in zero-inference
Author
styoun
Parents
1ea4bf45
Loading