llama.cpp
CUDA: revise q8_1 data layout for mul_mat_q
#7824
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
CUDA: revise q8_1 data layout for mul_mat_q
#7824
JohannesGaessler
merged 1 commit into
ggml-org:master
from
JohannesGaessler:cuda-mmq-q8_1-2
github-actions
added
Nvidia GPU
github-actions
added
ggml
slaren
approved these changes on 2024-06-08
CUDA: revise q8_1 data layout for mul_mat_q
05a5fa08
JohannesGaessler
force pushed
to
05a5fa08
1 year ago
JohannesGaessler
merged
42b53d19
into master
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
Assignees
No one assigned
Labels
Nvidia GPU
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub