llama.cpp
Vulkan: Add DP4A MMQ and Q8_1 quantization shader
#12135
Merged

Vulkan: Add DP4A MMQ and Q8_1 quantization shader #12135

0cc4m merged 13 commits into master from 0cc4m/vulkan-mmq-dp4a
0cc4m
github-actions github-actions added Vulkan
github-actions github-actions added ggml
jeffbolznv
0cc4m
0cc4m
netrunnereve
0cc4m
0cc4m
IMbackK
0cc4m
IMbackK
daniandtheweb
0cc4m
0cc4m
jeffbolznv
0cc4m
netrunnereve
0cc4m
0cc4m
0cc4m Vulkan: Add DP4A MMQ and Q8_1 quantization shader
2551791e
0cc4m Add q4_0 x q8_1 matrix matrix multiplication support
eec67ab6
0cc4m Vulkan: Add int8 coopmat MMQ support
249595d4
0cc4m Vulkan: Add q4_1, q5_0 and q5_1 quants, improve integer dot code
34ff5e15
0cc4m 0cc4m force pushed from 32bbd92b to 34ff5e15 1 year ago
0cc4m
0cc4m Add GL_EXT_integer_dot_product check
2c086fdf
jeffbolznv
0cc4m
jeffbolznv
jeffbolznv
0cc4m Remove ggml changes, fix mmq pipeline picker
45508b40
0cc4m Remove ggml changes, restore Intel coopmat behaviour
80a939e5
0cc4m
0cc4m Fix glsl compile attempt when integer vec dot is not supported
e0dedb2c
0cc4m 0cc4m marked this pull request as ready for review 1 year ago
0cc4m
jeffbolznv
jeffbolznv commented on 2025-03-29
0cc4m Remove redundant code, use non-saturating integer dot, enable all mat…
a527b9cc
0cc4m
0cc4m Remove redundant comment
1da87652
jeffbolznv
jeffbolznv commented on 2025-03-31
h9j6k
0cc4m Fix integer dot check
f3dec13c
0cc4m
h9j6k
0cc4m
0cc4m Fix compile issue with unsupported int dot glslc
a86c63fe
0cc4m
0cc4m Update Windows build Vulkan SDK version
7f5c84d5
github-actions github-actions added devops
0cc4m
jeffbolznv
jeffbolznv approved these changes on 2025-03-31
0cc4m 0cc4m merged a8a1f335 into master 1 year ago
0cc4m 0cc4m deleted the 0cc4m/vulkan-mmq-dp4a branch 1 year ago
0cc4m
jeffbolznv
easyfab
0cc4m
0cc4m

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone