opencl: add optimized q4_1 mm kernel for adreno #19840
Add Q4_1 OpenCL Kernels
66297b4b
opencl: refactor transpose
d0000b29
opencl: format
f53e7bcb
opencl: refactor q4_1 unpack
a116dbac
opencl: move `ggml_cl_mul_mat_q4_1_f32_adreno`
df29f27e
opencl: refactor `ggml_cl_mul_mat_q4_1_f32_adreno` and kernels
7fb70f94
opencl: rename kernel files and kernes
dddf931a
opencl: fix build for non adreno
8ef5b83c
lhez
force pushed
from
9dbdb499
to
8ef5b83c
5 days ago
opencl: move code around and format
a24cc95e
lhez
marked this pull request as ready for review 4 days ago
lhez
requested a review
from
lhez
4 days ago
lhez
approved these changes
on 2026-03-03
lhez
merged
24350fdf
into master 1 day ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub