llama.cpp
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support
#16900
Merged

Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support #16900

0cc4m merged 15 commits into master from 0cc4m/vulkan-mmq-dp4a-vec-k-quants
0cc4m
github-actions github-actions added Vulkan
github-actions github-actions added ggml
0cc4m 0cc4m force pushed from d5192bf4 to d2f8f00f 85 days ago
0cc4m
0cc4m 0cc4m marked this pull request as ready for review 85 days ago
0cc4m 0cc4m requested a review from jeffbolznv jeffbolznv 85 days ago
jeffbolznv
jeffbolznv commented on 2025-11-01
0cc4m
jeffbolznv
0cc4m 0cc4m force pushed from b153aac3 to 1b78909c 79 days ago
0cc4m
0cc4m 0cc4m force pushed from 1b78909c to 937f9925 71 days ago
0cc4m
0cc4m 0cc4m force pushed from b99726ce to 3c22e380 71 days ago
0cc4m
0cc4m
jeffbolznv
0cc4m
0cc4m 0cc4m force pushed from 3c22e380 to e086733b 67 days ago
0cc4m
Acly
0cc4m
Acly
0cc4m 0cc4m force pushed from e086733b to e69d645a 64 days ago
0cc4m 0cc4m marked this pull request as draft 64 days ago
0cc4m 0cc4m force pushed to 9cbe4f87 64 days ago
github-actions github-actions added testing
0cc4m
ggerganov
ggerganov
0cc4m
ggerganov
0cc4m
0cc4m
0cc4m
ggerganov
jeffbolznv
0cc4m
0cc4m
ggerganov
jeffbolznv
0cc4m
jeffbolznv
0cc4m
jeffbolznv
0cc4m vulkan: split mul_mmq_funcs for mul_mat_vecq use
f7a638f2
0cc4m add mxfp4 mmvq
211bcd49
0cc4m add q2_k mmvq
9ba12585
0cc4m add q3_k mmvq
9eeb42fc
0cc4m add q4_k and q5_k mmvq
741bf821
0cc4m add q6_k mmvq
7a8b853a
0cc4m handle 4x4 quants per mmvq thread
ef0060a9
0cc4m enable MUL_MAT_ID mmvq support
593e94fb
0cc4m enable subgroup optimizations for mul_mat_vec_id shaders
bb3bcaab
0cc4m device tuning
4512c55b
0cc4m request prealloc_y sync after quantization
dd54d395
0cc4m fix indentation
b119ac78
0cc4m fix llvmpipe test failures
f8e32881
0cc4m fix mul_mat_id mmvq condition
ad5127d9
0cc4m 0cc4m force pushed from 9cbe4f87 to ad5127d9 60 days ago
0cc4m 0cc4m marked this pull request as ready for review 60 days ago
0cc4m 0cc4m requested a review from jeffbolznv jeffbolznv 59 days ago
jeffbolznv
0cc4m
jeffbolznv
jeffbolznv commented on 2025-11-28
jeffbolznv
0cc4m fix unused variable warning
6cb0923b
0cc4m
jeffbolznv
jeffbolznv approved these changes on 2025-11-29
0cc4m 0cc4m merged 47a268ea into master 58 days ago
0cc4m 0cc4m deleted the 0cc4m/vulkan-mmq-dp4a-vec-k-quants branch 58 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone