vllm
[XPU][CT] support per-channel quantization in xpu fp8 linear method
#38316

Merged

[XPU][CT] support per-channel quantization in xpu fp8 linear method #38316

jikunshang merged 1 commit into vllm-project:main from yma11:per-channel

yma11 requested a review from

mgoin 43 days ago

yma11 requested a review from

robertgshaw2-redhat 43 days ago

yma11 requested a review from

tlrmchlsmth 43 days ago

yma11 requested a review from

yewentao256 43 days ago

yma11 requested a review from

pavanimajety 43 days ago

claude commented on 2026-03-27

gemini-code-assist commented on 2026-03-27

mergify added intel-gpu

mergify added needs-rebase

Add per-channel quantized model in compressed-tensors

ed9a974d

yma11 force pushed to ed9a974d 28 days ago

mergify removed needs-rebase

jikunshang approved these changes on 2026-04-12

jikunshang enabled auto-merge (squash) 27 days ago

github-actions added ready

disabled auto-merge 27 days ago
Manually disabled by user

jikunshang changed the title ~~[XPU] Add per-channel quantized model in compressed-tensors~~ [XPU] support per-channel quantization in fp8 linear method 27 days ago

jikunshang changed the title ~~[XPU] support per-channel quantization in fp8 linear method~~ [XPU] support per-channel quantization in xpu fp8 linear method 27 days ago

jikunshang changed the title ~~[XPU] support per-channel quantization in xpu fp8 linear method~~ [XPU][CT] support per-channel quantization in xpu fp8 linear method 27 days ago

jikunshang enabled auto-merge (squash) 27 days ago

jikunshang merged 394ff869 into main 27 days ago

Reviewers

jikunshang

claude

gemini-code-assist

mgoin

robertgshaw2-redhat

tlrmchlsmth

yewentao256

pavanimajety

Assignees

No one assigned

Labels

intel-gpu ready

Milestone

No milestone

vllm [XPU][CT] support per-channel quantization in xpu fp8 linear method #38316 Merged

[XPU][CT] support per-channel quantization in xpu fp8 linear method #38316

vllm
[XPU][CT] support per-channel quantization in xpu fp8 linear method
#38316

Merged