llama.cpp
CUDA: mul_mat_vec_q for batch sizes > 1
#5351
Merged

CUDA: mul_mat_vec_q for batch sizes > 1 #5351

JohannesGaessler
JohannesGaessler CUDA: mul_mat_vec_q for batch sizes > 1
dbb795b9
JohannesGaessler JohannesGaessler force pushed to dbb795b9 2 years ago
ggerganov
ggerganov ggerganov requested a review from slaren slaren 2 years ago
JohannesGaessler
slaren
slaren approved these changes on 2024-02-06
JohannesGaessler JohannesGaessler merged 2c516611 into master 2 years ago
ggerganov
slaren
ggerganov
slaren
JohannesGaessler
slaren
ggerganov
JohannesGaessler
JohannesGaessler
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone