DeepSpeed
791935ca
- Apply weight quantization in BERT container
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Apply weight quantization in BERT container
References
#2554 - Inference Refactor (replace_with_policy, model_implementations)
Author
lekurile
Parents
bc9002ae
Loading