llama.cpp
308f61c3
- opencl: improve get_rows, cpy, concat and q6_k flat gemv (#24160)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 days ago
opencl: improve get_rows, cpy, concat and q6_k flat gemv (#24160) * opencl: allow multiple workgroups for large rows * opencl: improve small cpy * opencl: packed concat for small input * opencl: tweak flat q6_K gemv, increase N_DST and remap threads
References
#24160 - opencl: improve get_rows, cpy, concat and q6_k flat gemv
Author
lhez
Parents
da87e9b6
Loading