DeepSpeed
Add more weight only quantization algorithms into DeepSpeed inference.
#4577
Open

Add more weight only quantization algorithms into DeepSpeed inference. #4577

ftian1 wants to merge 1 commit into deepspeedai:master from ftian1:woq
ftian1
ftian1 ftian1 requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
ftian1 ftian1 requested a review from jeffra jeffra 2 years ago
ftian1 ftian1 requested a review from mrwyattii mrwyattii 2 years ago
ftian1 ftian1 requested a review from awan-10 awan-10 2 years ago
ftian1 ftian1 requested a review from cmikeh2 cmikeh2 2 years ago
ftian1 ftian1 requested a review from arashb arashb 2 years ago
ftian1 Add more weight only quantization algorithms into DeepSpeed inference.
b230cd96
delock

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone