llama.cpp
vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl
#21029
Open

vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl #21029

mkoker wants to merge 1 commit into ggml-org:master from mkoker:vulkan-fa-q4_1-q5_0-q5_1
mkoker
mkoker mkoker requested a review 10 days ago
github-actions github-actions added Vulkan
github-actions github-actions added ggml
ggml-gh-bot
mkoker mkoker force pushed from ce0593f8 to 7fbd00e9 10 days ago
mkoker mkoker changed the title vulkan: add flash attention dequant for q4_1, q5_0, q5_1 KV cache types vulkan: add FA dequant for q4_1, q5_0, q5_1 10 days ago
mkoker mkoker force pushed from 7fbd00e9 to a712a8a7 9 days ago
mkoker mkoker changed the title vulkan: add FA dequant for q4_1, q5_0, q5_1 vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl 9 days ago
aviallon
cHunter789
0cc4m
mkoker
0cc4m
0cc4m requested changes on 2026-03-30
cHunter789
mkoker mkoker force pushed from a712a8a7 to 355a65f9 5 days ago
mkoker
cHunter789
0cc4m
0cc4m commented on 2026-04-02
jeffbolznv
0cc4m
mkoker mkoker force pushed from d6ba11b1 to b9e5364f 3 days ago
mkoker vulkan: add FA dequant for q4_1, q5_0, q5_1, iq4_nl
370e90da
mkoker mkoker force pushed from b9e5364f to 370e90da 2 days ago
0cc4m
0cc4m approved these changes on 2026-04-04

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone