llama.cpp
opencl: add optimized q4_1 mm kernel for adreno
#19840
Merged

opencl: add optimized q4_1 mm kernel for adreno #19840

shaofeiqi
github-actions github-actions added ggml
github-actions github-actions added OpenCL
shaofeiqi Add Q4_1 OpenCL Kernels
66297b4b
lhez opencl: refactor transpose
d0000b29
lhez opencl: format
f53e7bcb
lhez opencl: refactor q4_1 unpack
a116dbac
lhez opencl: move `ggml_cl_mul_mat_q4_1_f32_adreno`
df29f27e
lhez opencl: refactor `ggml_cl_mul_mat_q4_1_f32_adreno` and kernels
7fb70f94
lhez opencl: rename kernel files and kernes
dddf931a
lhez opencl: fix build for non adreno
8ef5b83c
lhez lhez force pushed from 9dbdb499 to 8ef5b83c 5 days ago
lhez opencl: move code around and format
a24cc95e
lhez
lhez lhez marked this pull request as ready for review 4 days ago
lhez lhez requested a review from lhez lhez 4 days ago
lhez lhez requested a review from max-krasnyansky max-krasnyansky 4 days ago
lhez
lhez approved these changes on 2026-03-03
lhez lhez merged 24350fdf into master 1 day ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone