llama.cpp
ggml : add CPU TurboQuant KV cache types (TBQ3_0 / TBQ4_0)
#21089
Open

ggml : add CPU TurboQuant KV cache types (TBQ3_0 / TBQ4_0) #21089

elusznik wants to merge 3 commits into ggml-org:master from elusznik:turboquant-cpu-tbq-pr
elusznik
elusznik feat: add CPU TurboQuant KV cache types
d5a71640
elusznik ggml : limit the first TurboQuant CPU PR to TBQ
f96df927
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added server
github-actions github-actions added ggml
ggml-gh-bot
elusznik
elusznik elusznik marked this pull request as ready for review 5 days ago
elusznik elusznik requested a review 5 days ago
elusznik elusznik requested a review from ggerganov ggerganov 5 days ago
elusznik elusznik requested a review from ngxson ngxson 5 days ago
elusznik elusznik requested a review from CISC CISC 5 days ago
elusznik elusznik requested a review 5 days ago
elusznik elusznik requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 5 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-28
animehacker
animehacker
elusznik ggml : fix TurboQuant CPU review issues
0aae7d78
elusznik
CuriosityQuantified
elusznik
CuriosityQuantified
TheTom
mihai-chiorean

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone