llama.cpp
1cd06fa2 - CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)

Commit
2 years ago
CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)
Parents
Loading