llama.cpp
vulkan: Implement grouped query attention in the coopmat2 FA shader
#12559

Merged

vulkan: Implement grouped query attention in the coopmat2 FA shader #12559

0cc4m merged 1 commit into ggml-org:master from jeffbolznv:flash_gqa

vulkan: Implement grouped query attention in the coopmat2 FA shader

99a3792a

jeffbolznv requested a review from

0cc4m 269 days ago

github-actions added Vulkan

github-actions added ggml

0cc4m approved these changes on 2025-04-02

0cc4m merged be0a0f8c into master 261 days ago

Reviewers

0cc4m

Assignees

No one assigned

Labels

Vulkan ggml

Milestone

No milestone