llama.cpp
vulkan: support arbitrary KV dimension in flash attention
#16160
Merged

Loading