opencl: add kernel to handle mat mul in attention to improve encoding speed #17181
Add mul_mm_f16_f32_kq_kqv kernel
9e5c5960
Add ggml_cl_mul_mat_kq_kqv_adreno func
24f32df4
fix whitespace
dada5171
remove unused variable
0fc4b8bd
remove redundant
301662b2
refactor and clean up
41bf54f8
remove trailing whitespace
b3ee2ab0
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub