llama.cpp
CUDA: limit number of FA stream-k CUDA blocks
#20586
Merged

CUDA: limit number of FA stream-k CUDA blocks #20586

JohannesGaessler
JohannesGaessler CUDA: limit number of FA stream-k CUDA blocks
cc1232a4
am17an
am17an approved these changes on 2026-03-15
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler JohannesGaessler merged ae40cd27 into master 62 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone