llama.cpp
CUDA: avoid mul + bias fusion when buffers are split
#16935
Merged

CUDA: avoid mul + bias fusion when buffers are split #16935

am17an merged 1 commit into ggml-org:master from am17an:cuda-fix-sm-row
am17an
am17an CUDA: avoid mul + bias fusion when doing fusion
1d3a6152
am17an am17an requested a review from slaren slaren 125 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
am17an am17an requested a review from JohannesGaessler JohannesGaessler 125 days ago
am17an am17an changed the title CUDA: avoid mul + bias fusion when doing fusion CUDA: avoid mul + bias fusion when buffers are split 125 days ago
JohannesGaessler
JohannesGaessler approved these changes on 2025-11-02
IMbackK
IMbackK approved these changes on 2025-11-02
am17an
IMbackK
JohannesGaessler
am17an
am17an
JohannesGaessler
am17an
slaren
am17an
JohannesGaessler
IMbackK
am17an
am17an am17an merged 2759ccdb into master 124 days ago
am17an am17an deleted the cuda-fix-sm-row branch 124 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone