llama.cpp
vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl
#21029
Merged

vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl #21029

mkoker
mkoker mkoker requested a review 20 days ago
github-actions github-actions added Vulkan
github-actions github-actions added ggml
ggml-gh-bot
mkoker mkoker force pushed from ce0593f8 to 7fbd00e9 20 days ago
mkoker mkoker changed the title vulkan: add flash attention dequant for q4_1, q5_0, q5_1 KV cache types vulkan: add FA dequant for q4_1, q5_0, q5_1 20 days ago
mkoker mkoker force pushed from 7fbd00e9 to a712a8a7 20 days ago
mkoker mkoker changed the title vulkan: add FA dequant for q4_1, q5_0, q5_1 vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl 20 days ago
aviallon
cHunter789
0cc4m
mkoker
0cc4m
0cc4m requested changes on 2026-03-30
cHunter789
mkoker mkoker force pushed from a712a8a7 to 355a65f9 16 days ago
mkoker
cHunter789
0cc4m
0cc4m commented on 2026-04-02
jeffbolznv
0cc4m
mkoker mkoker force pushed from d6ba11b1 to b9e5364f 13 days ago
mkoker vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl
370e90da
mkoker mkoker force pushed from b9e5364f to 370e90da 13 days ago
0cc4m
0cc4m approved these changes on 2026-04-04
0cc4m
JohannesGaessler
JohannesGaessler approved these changes on 2026-04-07
0cc4m 0cc4m merged edd4d9bc into master 8 days ago
0cc4m

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone