llama.cpp
CUDA: mul_mat_vec_q max. batch size 8 -> 4
#5370
Merged

CUDA: mul_mat_vec_q max. batch size 8 -> 4 #5370

JohannesGaessler
JohannesGaessler CUDA: mul_mat_vec_q max. batch size 8 -> 4
4e1d68b3
ggerganov
ggerganov approved these changes on 2024-02-06
ggerganov ggerganov merged 17c97fb0 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone