opencl: improve get_rows, cpy, concat and q6_k flat gemv #24160
opencl: allow multiple workgroups for large rows
bfedc327
opencl: improve small cpy
6da0c8eb
opencl: packed concat for small input
3774105b
opencl: tweak flat q6_K gemv, increase N_DST and remap threads
0fb3d35a
lhez
marked this pull request as ready for review 3 days ago
lhez
requested a review
3 days ago
CISC
approved these changes
on 2026-06-05
lhez
merged
308f61c3
into master 2 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub