llama.cpp
1cd06fa2
- CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)
References
#2596 - CUDA: Add launch bounds for Pascal, small q4_K, q5_K refactor
Author
JohannesGaessler
Parents
2feb8934
Loading