llama.cpp
vulkan: dynamic subgroup size for the remaining k quants
#10745
Merged

vulkan: dynamic subgroup size for the remaining k quants #10745

0cc4m merged 2 commits into ggml-org:master from vulkan2
netrunnereve
netrunnereve q5_k
c2aa654a
netrunnereve revert as multi row isnt faster for k quants
8ee6beea
netrunnereve netrunnereve requested a review from 0cc4m 0cc4m 283 days ago
github-actions github-actions added Vulkan
github-actions github-actions added ggml
jeffbolznv
0cc4m
0cc4m approved these changes on 2024-12-10
jeffbolznv
jeffbolznv
jeffbolznv approved these changes on 2024-12-10
0cc4m 0cc4m merged dafae66c into master 283 days ago
netrunnereve netrunnereve deleted the vulkan2 branch 282 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone