llama.cpp
CUDA: revise q8_1 data layout for mul_mat_q
#7824
Merged

CUDA: revise q8_1 data layout for mul_mat_q #7824

JohannesGaessler
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
github-actions
slaren
slaren approved these changes on 2024-06-08
JohannesGaessler CUDA: revise q8_1 data layout for mul_mat_q
05a5fa08
JohannesGaessler JohannesGaessler force pushed to 05a5fa08 1 year ago
JohannesGaessler JohannesGaessler merged 42b53d19 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone