llama.cpp
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
#15131
Merged

CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 #15131

JohannesGaessler
JohannesGaessler CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
e70fa55f
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
slaren
slaren
slaren approved these changes on 2025-08-06
JohannesGaessler JohannesGaessler force pushed 316 days ago
JohannesGaessler JohannesGaessler force pushed 316 days ago
JohannesGaessler JohannesGaessler force pushed 316 days ago
JohannesGaessler JohannesGaessler force pushed 316 days ago
JohannesGaessler JohannesGaessler force pushed 316 days ago
JohannesGaessler try CI fix
52d9ccce
JohannesGaessler JohannesGaessler force pushed to 52d9ccce 316 days ago
IMbackK
JohannesGaessler
IMbackK
IMbackK
IMbackK
JohannesGaessler
IMbackK
slaren
JohannesGaessler JohannesGaessler force pushed to 52d9ccce 316 days ago
JohannesGaessler JohannesGaessler merged 1d72c841 into master 316 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone