Update qMoE spec to support block quantization #25641
update qMoE to support block_size
432ed683
tianleiwu
force pushed
from
8b9e59be
to
432ed683
240 days ago
fix ,
dec42f67
format
f209bb55
update doc
b532a767
tianleiwu
merged
59871e3b
into main 239 days ago
tianleiwu
deleted the tlwu/block_wise_qmoe branch 239 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub