vllm
[CPU] Support int8 compute mode in CPU AWQ
#35697
Merged

[CPU] Support int8 compute mode in CPU AWQ #35697

yintong-lu
yintong-lu yintong-lu requested a review from mgoin mgoin 41 days ago
yintong-lu yintong-lu requested a review from tlrmchlsmth tlrmchlsmth 41 days ago
yintong-lu yintong-lu requested a review from WoosukKwon WoosukKwon 41 days ago
yintong-lu yintong-lu requested a review from yewentao256 yewentao256 41 days ago
yintong-lu yintong-lu requested a review from robertgshaw2-redhat robertgshaw2-redhat 41 days ago
yintong-lu yintong-lu requested a review from pavanimajety pavanimajety 41 days ago
github-actions
mergify
mergify mergify added cpu
mergify mergify added needs-rebase
gemini-code-assist
gemini-code-assist commented on 2026-03-02
bigPYJ1151 bigPYJ1151 assigned bigPYJ1151 bigPYJ1151 40 days ago
yintong-lu
bigPYJ1151 bigPYJ1151 changed the title v1 [CPU] Supportint8 compute mode to CPU AWQ/GPTQ 40 days ago
bigPYJ1151 bigPYJ1151 changed the title [CPU] Supportint8 compute mode to CPU AWQ/GPTQ [CPU] Support int8 compute mode in CPU AWQ/GPTQ 40 days ago
bigPYJ1151
bigPYJ1151 commented on 2026-03-03
yintong-lu yintong-lu force pushed from 35467651 to 161158ec 39 days ago
mergify mergify removed needs-rebase
mergify
mergify
mergify mergify added needs-rebase
yintong-lu yintong-lu force pushed from 161158ec to 28fd3ba7 36 days ago
mergify mergify added ci/build
mergify mergify removed needs-rebase
mergify
mergify
mergify
mergify mergify added needs-rebase
yintong-lu yintong-lu force pushed from 9becca66 to 22c0d2a5 26 days ago
mergify mergify removed needs-rebase
mergify
yintong-lu yintong-lu force pushed from 22c0d2a5 to dbcaaade 22 days ago
bigPYJ1151
bigPYJ1151 commented on 2026-03-09
aalbersk
bigPYJ1151
bigPYJ1151 commented on 2026-03-26
mergify
mergify mergify added needs-rebase
yintong-lu v1
cc3e18d6
yintong-lu fix conflicts and resolve comment-related issues
f9231c49
yintong-lu replace awq method with sglang int4 kernel
5a3869aa
yintong-lu fix func import error and remove debug prints
1a799e36
yintong-lu pre-commit fix: remove unused variables and fix long lines
dde60ed3
yintong-lu yintong-lu force pushed from 6b20b553 to dde60ed3 12 days ago
mergify mergify removed needs-rebase
yintong-lu revert GPTQ int8 paths, add AMX check, rename flags and set default W…
2eb074d9
bigPYJ1151 bigPYJ1151 changed the title [CPU] Support int8 compute mode in CPU AWQ/GPTQ [CPU] Support int8 compute mode in CPU AWQ 11 days ago
bigPYJ1151
bigPYJ1151 commented on 2026-03-31
yintong-lu add CPU-device skipping guard and enable awq tests in CI
5491e079
bigPYJ1151
bigPYJ1151 approved these changes on 2026-03-31
bigPYJ1151 bigPYJ1151 added ready
bigPYJ1151
bigPYJ1151 commented on 2026-03-31
yintong-lu store use_w4a8 on layers for torchcompile compatibility
049b11d5
yintong-lu yintong-lu force pushed from 70b0fc2e to 049b11d5 11 days ago
bigPYJ1151 bigPYJ1151 merged f09daea2 into main 11 days ago

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone