DeepSpeed
fb8887c9
- Update deepspeed/inference/v2/modules/implementations/linear/quantized_linear.py
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Update deepspeed/inference/v2/modules/implementations/linear/quantized_linear.py
References
#5234 - FP6 quantization end-to-end.
Author
mrwyattii
Parents
c2e6ebb9
Loading