llama.cpp
vulkan: Implement grouped query attention in the coopmat2 FA shader
#12559
Merged

vulkan: Implement grouped query attention in the coopmat2 FA shader #12559

0cc4m merged 1 commit into ggml-org:master from jeffbolznv:flash_gqa
jeffbolznv
jeffbolznv vulkan: Implement grouped query attention in the coopmat2 FA shader
99a3792a
jeffbolznv jeffbolznv requested a review from 0cc4m 0cc4m 266 days ago
github-actions github-actions added Vulkan
github-actions github-actions added ggml
0cc4m
0cc4m approved these changes on 2025-04-02
0cc4m 0cc4m merged be0a0f8c into master 257 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone