llama.cpp
CUDA: batch out_prod inner loop with cublasSgemmStridedBatched
#22651
Merged

CUDA: batch out_prod inner loop with cublasSgemmStridedBatched #22651

leonardHONG
leonardHONG CUDA: batch out_prod inner loop with cublasSgemmStridedBatched
cbde47de
leonardHONG CUDA: batch out_prod inner loop with cublasSgemmStridedBatched
5a5bbe4d
leonardHONG leonardHONG requested a review from ggerganov ggerganov 39 days ago
leonardHONG leonardHONG requested a review 39 days ago
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
am17an
am17an approved these changes on 2026-05-04
JohannesGaessler
JohannesGaessler approved these changes on 2026-05-04
leonardHONG CUDA: add cublasSgemmStridedBatched mapping for HIP and MUSA backends
a6284c91
leonardHONG leonardHONG requested a review from IMbackK IMbackK 37 days ago
JohannesGaessler
JohannesGaessler approved these changes on 2026-05-07
JohannesGaessler JohannesGaessler merged 05ff59cb into master 35 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone