PR #18564 CUDA: only allocate FA tmp buffer if needed

CUDA: only allocate FA tmp buffer if needed #18564

JohannesGaessler merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fa-no-pool-alloc

CUDA: only allocate FA tmp buffer if needed

63043623

github-actions added Nvidia GPU

github-actions added ggml

am17an approved these changes on 2026-01-03

JohannesGaessler merged 0f2e42ca into master 54 days ago

Reviewers

am17an

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone