llama.cpp
vulkan: unpack more values at a time for iquants mat mul
#14485
Merged

vulkan: unpack more values at a time for iquants mat mul #14485

0cc4m merged 1 commit into ggml-org:master from iquants
netrunnereve
netrunnereve vulkan: increase LOAD_VEC_A to 8 (IQ1/IQ2) or 4 (IQ3)
5712c2ff
github-actions github-actions added Vulkan
github-actions github-actions added ggml
0cc4m
0cc4m approved these changes on 2025-07-06
0cc4m 0cc4m merged 6491d6e4 into master 221 days ago
netrunnereve netrunnereve deleted the iquants branch 221 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone