llama.cpp
CUDA: mul_mat_v support for batch sizes > 1
#14262
Merged

CUDA: mul_mat_v support for batch sizes > 1 #14262

JohannesGaessler
JohannesGaessler CUDA: mul_mat_v support for batch sizes > 1
e22e345d
IMbackK
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler
slaren
JohannesGaessler use 64 bit math for initial offset calculation
2d24a9ce
JohannesGaessler
slaren
JohannesGaessler fix mul_mat_id
60ce04c3
JohannesGaessler
yeahdongcn
IMbackK
yeahdongcn
yeahdongcn
JohannesGaessler
yeahdongcn
JohannesGaessler
IMbackK
IMbackK
IMbackK approved these changes on 2025-06-22
IMbackK
JohannesGaessler JohannesGaessler merged defe2158 into master 93 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone