llama.cpp
opencl: improve get_rows, cpy, concat and q6_k flat gemv
#24160
Merged

opencl: improve get_rows, cpy, concat and q6_k flat gemv #24160

lhez
lhez opencl: allow multiple workgroups for large rows
bfedc327
lhez opencl: improve small cpy
6da0c8eb
lhez opencl: packed concat for small input
3774105b
lhez opencl: tweak flat q6_K gemv, increase N_DST and remap threads
0fb3d35a
github-actions github-actions added ggml
github-actions github-actions added OpenCL
lhez lhez marked this pull request as ready for review 3 days ago
lhez lhez requested a review 3 days ago
max-krasnyansky
max-krasnyansky approved these changes on 2026-06-05
lhez
CISC
CISC approved these changes on 2026-06-05
lhez lhez merged 308f61c3 into master 2 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone