DeepSpeed
4114beac
- add configurable quantization for enabling 4-bit inference
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
add configurable quantization for enabling 4-bit inference
Author
Reza Yazdani Aminabadi
Parents
d40a15fc
Loading