openvino
[CPU] Add separate quantization of key and value to SDPA
#35366
Merged

[CPU] Add separate quantization of key and value to SDPA #35366

EgorDuplensky
EgorDuplensky [CPU] Add separate quantization of key and value to SDPA
c42556b1
EgorDuplensky EgorDuplensky requested a review 67 days ago
EgorDuplensky EgorDuplensky requested a review 67 days ago
github-actions github-actions added category: CPU
github-actions github-actions added category: build
EgorDuplensky EgorDuplensky assigned maxnick maxnick 66 days ago
EgorDuplensky
maxnick maxnick added this to the 2026.2 milestone 66 days ago
maxnick
maxnick commented on 2026-04-16
EgorDuplensky Reuse attn_memcpy kernel
b95d0ed9
maxnick
maxnick approved these changes on 2026-04-17
EgorDuplensky EgorDuplensky merged b47503cb into master 65 days ago
EgorDuplensky EgorDuplensky deleted the turboquant-pr1-non-batched-kv-cache-quantize branch 65 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
Labels
Milestone