DeepSpeed
support autoTP with weight only quantization in DS inference path
#4750
Open

support autoTP with weight only quantization in DS inference path #4750

ftian1 wants to merge 3 commits into deepspeedai:master from ftian1:master
ftian1
ftian1 ftian1 requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
ftian1 ftian1 requested a review from jeffra jeffra 2 years ago
ftian1 ftian1 requested a review from mrwyattii mrwyattii 2 years ago
ftian1 ftian1 requested a review from awan-10 awan-10 2 years ago
ftian1 ftian1 requested a review from cmikeh2 cmikeh2 2 years ago
ftian1 ftian1 requested a review from arashb arashb 2 years ago
ftian1 ftian1 requested a review from tjruwase tjruwase 2 years ago
delock
ftian1
delock
baodii
baodii commented on 2023-12-12
delock
loadams
delock
ftian1 ftian1 force pushed from 7fc67306 to 46f7ef25 344 days ago
ftian1 ftian1 requested a review from hwchen2017 hwchen2017 344 days ago
ftian1 ftian1 requested a review from tohtana tohtana 344 days ago
ftian1 ftian1 requested a review from loadams loadams 344 days ago
ftian1
ftian1 support the wildcard * in the weight_only ds_config
b3edd2f7
ftian1 support autoTP with weight quantization in DS inference path
9b947a77
ftian1 fix typo in LmHeadLinearAllreduce initialization
46f7ef25
loadams loadams removed review request from arashb arashb 344 days ago
loadams loadams removed review request from cmikeh2 cmikeh2 344 days ago
loadams loadams removed review request from awan-10 awan-10 344 days ago
loadams loadams removed review request from mrwyattii mrwyattii 344 days ago
loadams loadams removed review request from RezaYazdaniAminabadi RezaYazdaniAminabadi 344 days ago
loadams loadams assigned loadams loadams 344 days ago
loadams

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone