llama.cpp
opencl: add kernel to handle mat mul in attention to improve encoding speed
#17181
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
7
Changes
View On
GitHub
Commits
Add mul_mm_f16_f32_kq_kqv kernel
shaofeiqi
committed
47 days ago
Add ggml_cl_mul_mat_kq_kqv_adreno func
shaofeiqi
committed
47 days ago
fix whitespace
shaofeiqi
committed
47 days ago
remove unused variable
shaofeiqi
committed
47 days ago
remove redundant
shaofeiqi
committed
47 days ago
refactor and clean up
shaofeiqi
committed
43 days ago
remove trailing whitespace
shaofeiqi
committed
41 days ago
Loading