CUDA: batch out_prod inner loop with cublasSgemmStridedBatched #22651
CUDA: batch out_prod inner loop with cublasSgemmStridedBatched
cbde47de
CUDA: batch out_prod inner loop with cublasSgemmStridedBatched
5a5bbe4d
am17an
approved these changes
on 2026-05-04
CUDA: add cublasSgemmStridedBatched mapping for HIP and MUSA backends
a6284c91
Assignees
No one assigned
Labels
testing
Nvidia GPU
ggml
Login to write a write a comment.
Login via GitHub