openvino
[draft] Fake turboquant
#35186
Open

[draft] Fake turboquant #35186

isanghao wants to merge 6 commits into openvinotoolkit:master from isanghao:fake_tbq
isanghao
isanghao intermediate stage of kv_copy
83133e3e
isanghao turboquant accuracy test passes
b4c02597
isanghao option for fake turboquant
12c72d4c
isanghao Slight accuracy improvement with turboquant:
a7750e69
isanghao bug fix for accuracy
d4bc3104
isanghao enable turboquant only when head size is multiply of 128
6ca2cff0
isanghao isanghao added do_not_review
isanghao isanghao added do_not_merge
github-actions github-actions added category: GPU

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone