llama.cpp
vulkan: Implement grouped query attention in the coopmat2 FA shader
#12559
Merged

Commits
  • vulkan: Implement grouped query attention in the coopmat2 FA shader
    jeffbolznv committed 1 year ago
Loading