llama.cpp
63e66fdd
- opencl: use flat variants of q4_K and q6_K gemv for very large M (#24006)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 days ago
opencl: use flat variants of q4_K and q6_K gemv for very large M (#24006)
References
#24006 - opencl: use flat variants of gemv for very large M
Author
lhez
Parents
5c394fdc
Loading