llama.cpp
Vulkan: Add DP4A MMQ and Q8_1 quantization shader
#12135

Merged

Vulkan: Add DP4A MMQ and Q8_1 quantization shader #12135

0cc4m merged 13 commits into master from 0cc4m/vulkan-mmq-dp4a

github-actions added Vulkan

github-actions added ggml

Vulkan: Add DP4A MMQ and Q8_1 quantization shader

2551791e

Add q4_0 x q8_1 matrix matrix multiplication support

eec67ab6

Vulkan: Add int8 coopmat MMQ support

249595d4

Vulkan: Add q4_1, q5_0 and q5_1 quants, improve integer dot code

34ff5e15

0cc4m force pushed from 32bbd92b to 34ff5e15 1 year ago

Add GL_EXT_integer_dot_product check

2c086fdf

Remove ggml changes, fix mmq pipeline picker

45508b40

Remove ggml changes, restore Intel coopmat behaviour

80a939e5

Fix glsl compile attempt when integer vec dot is not supported

e0dedb2c

0cc4m marked this pull request as ready for review 1 year ago

jeffbolznv commented on 2025-03-29

Remove redundant code, use non-saturating integer dot, enable all mat…

a527b9cc

Remove redundant comment

1da87652

jeffbolznv commented on 2025-03-31

Fix integer dot check

f3dec13c

Fix compile issue with unsupported int dot glslc

a86c63fe

Update Windows build Vulkan SDK version

7f5c84d5

github-actions added devops

jeffbolznv approved these changes on 2025-03-31

0cc4m merged a8a1f335 into master 1 year ago

0cc4m deleted the 0cc4m/vulkan-mmq-dp4a branch 1 year ago

Reviewers

jeffbolznv

Assignees

No one assigned

Labels

Vulkan devops ggml

Milestone

No milestone

llama.cpp Vulkan: Add DP4A MMQ and Q8_1 quantization shader #12135 Merged

Vulkan: Add DP4A MMQ and Q8_1 quantization shader #12135

llama.cpp
Vulkan: Add DP4A MMQ and Q8_1 quantization shader
#12135

Merged