llama.cpp
CUDA: attention sinks for mma FlashAttention
#15157
Merged

Loading