llama.cpp
opencl: add optimized q8_0 mm kernel for adreno
#18871
Merged

opencl: add optimized q8_0 mm kernel for adreno #18871

shaofeiqi
shaofeiqi shaofeiqi requested a review from lhez lhez 153 days ago
shaofeiqi shaofeiqi requested a review from max-krasnyansky max-krasnyansky 153 days ago
github-actions github-actions added ggml
github-actions github-actions added OpenCL
lhez
lhez lhez marked this pull request as draft 153 days ago
lhez lhez force pushed from b1b5284b to bcacdc32 144 days ago
lhez lhez force pushed from bcacdc32 to 061bbff4 144 days ago
shaofeiqi Add Q8_0 OpenCL kernel
5b320b83
lhez opencl: fix build for non-adreno
8e968277
lhez opencl: refactor q8_0
e15ba0e2
lhez opencl: enforce subgroup size of 64 for adreno for q8_0
7ae603ed
lhez lhez force pushed from 061bbff4 to 7ae603ed 141 days ago
lhez
lhez lhez marked this pull request as ready for review 139 days ago
lhez opencl: suppress warning when adreno kernels are disabled
791524c4
lhez
lhez approved these changes on 2026-01-30
lhez
lhez lhez merged 971facc3 into master 138 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone