support autoTP with weight only quantization in DS inference path #4750
baodii
commented
on 2023-12-12
ftian1
force pushed
from
7fc67306
to
46f7ef25
1 year ago
support the wildcard * in the weight_only ds_config
b3edd2f7
support autoTP with weight quantization in DS inference path
9b947a77
fix typo in LmHeadLinearAllreduce initialization
46f7ef25
Login to write a write a comment.
Login via GitHub