support autoTP with weight only quantization in DS inference path #4750
baodii
commented
on 2023-12-12
ftian1
force pushed
from
7fc67306
to
46f7ef25
344 days ago
support the wildcard * in the weight_only ds_config
b3edd2f7
support autoTP with weight quantization in DS inference path
9b947a77
fix typo in LmHeadLinearAllreduce initialization
46f7ef25
Login to write a write a comment.
Login via GitHub