llama.cpp
CUDA: tuned mul_mat_q kernels
#2546
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
CUDA: tuned mul_mat_q kernels
#2546
JohannesGaessler
merged 1 commit into
ggml-org:master
from
JohannesGaessler:cuda-faster-mmq-7
CUDA: tuned mul_mat_q kernels
ca32203c
JohannesGaessler
force pushed
to
ca32203c
2 years ago
slaren
approved these changes on 2023-08-08
JohannesGaessler
merged
25d43e0e
into master
2 years ago
SlyEcho
commented on 2023-08-09
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
SlyEcho
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub