PR #14840 CUDA: fix overflow in FA, tune performance

CUDA: fix overflow in FA, tune performance #14840

JohannesGaessler merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fa-fix-overflow-2

github-actions added Nvidia GPU

github-actions added ggml

slaren approved these changes on 2025-07-23

ggerganov commented on 2025-07-23

CUDA: fix overflow in FA, tune performance

d4209ee4

JohannesGaessler force pushed from ec05b081 to d4209ee4 48 days ago

ggerganov approved these changes on 2025-07-23

JohannesGaessler merged a86f52b2 into master 48 days ago

Reviewers

ggerganov

slaren

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone