llama.cpp
1cd06fa2 - CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)

Commit

2 years ago

CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)

References

#2596 - CUDA: Add launch bounds for Pascal, small q4_K, q5_K refactor

Author

JohannesGaessler

JohannesGaessler

Parents

Loading