DeepSpeed
support autoTP with weight only quantization in DS inference path
#4750
Open

support autoTP with weight only quantization in DS inference path #4750

ftian1 wants to merge 3 commits into deepspeedai:master from ftian1:master
ftian1
ftian1 ftian1 requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
ftian1 ftian1 requested a review from jeffra jeffra 2 years ago
ftian1 ftian1 requested a review from mrwyattii mrwyattii 2 years ago
ftian1 ftian1 requested a review from awan-10 awan-10 2 years ago
ftian1 ftian1 requested a review from cmikeh2 cmikeh2 2 years ago
ftian1 ftian1 requested a review from arashb arashb 2 years ago
ftian1 ftian1 requested a review from tjruwase tjruwase 2 years ago
delock
ftian1
delock
baodii
baodii commented on 2023-12-12
delock
loadams
delock
ftian1 ftian1 force pushed from 7fc67306 to 46f7ef25 1 year ago
ftian1 ftian1 requested a review from hwchen2017 hwchen2017 1 year ago
ftian1 ftian1 requested a review from tohtana tohtana 1 year ago
ftian1 ftian1 requested a review from loadams loadams 1 year ago
ftian1
ftian1 support the wildcard * in the weight_only ds_config
b3edd2f7
ftian1 support autoTP with weight quantization in DS inference path
9b947a77
ftian1 fix typo in LmHeadLinearAllreduce initialization
46f7ef25
loadams loadams removed review request from arashb arashb 1 year ago
loadams loadams removed review request from cmikeh2 cmikeh2 1 year ago
loadams loadams removed review request from awan-10 awan-10 1 year ago
loadams loadams removed review request from mrwyattii mrwyattii 1 year ago
loadams loadams removed review request from RezaYazdaniAminabadi RezaYazdaniAminabadi 1 year ago
loadams loadams assigned loadams loadams 1 year ago
loadams

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone