llama.cpp
49af767f
- build : add compile option to force use of MMQ kernels
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
build : add compile option to force use of MMQ kernels
References
cuda-quantum-batch
#3776 - cuda : improve text-generation and batched decoding performance
Author
ggerganov
Parents
a4e15a36
Files
3
CMakeLists.txt
Makefile
ggml-cuda.cu
Loading