DeepSpeed
d2cf66a6
- release inference quantized kernels (#1104)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
4 years ago
release inference quantized kernels (#1104)
References
#1104 - Release inference quantized kernels
Author
RezaYazdaniAminabadi
Parents
10104284
Files
2
deepspeed
inference
engine.py
runtime
weight_quantizer.py
Loading