PR #15131 CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16

CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 #15131

JohannesGaessler merged 2 commits into ggml-org:master from JohannesGaessler:cuda-mmf-3

CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16

e70fa55f

github-actions added Nvidia GPU

github-actions added ggml

slaren approved these changes on 2025-08-06

JohannesGaessler force pushed 316 days ago

try CI fix

52d9ccce

JohannesGaessler force pushed to 52d9ccce 316 days ago

JohannesGaessler force pushed to 52d9ccce 316 days ago

JohannesGaessler merged 1d72c841 into master 316 days ago

Reviewers

slaren

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone