llama.cpp
CUDA: generalized (mma) FA, add Volta support
#17505
Merged

Loading