openvino
Enable sdpa micro 4bit kv cache
#35231
Open

Enable sdpa micro 4bit kv cache #35231

byungilm
byungilm Make logic to enable 4bit SDPA KV-cache
8e1530d9
byungilm initial sdpa_opt with debug
df2bd636
byungilm Reduce cache memory use by using head_size half
d87202c9
byungilm Reduce KV-cache memory size halve
8f591438
byungilm Resolve accuracy issue if head_size is not multiple of 2*subgroup_size
1c6265f9
byungilm Resolve invalid argument index error for SDPA backend execution of sy…
0234e6d2
byungilm Minor fix
8df01924
byungilm Enable sdpa-micro kernel 4bit KV-cache support
9a820140
github-actions github-actions added category: GPU

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone