llama.cpp
CUDA: fix overflow in FA, tune performance
#14840
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
CUDA: fix overflow in FA, tune performance
#14840
JohannesGaessler
merged 1 commit into
ggml-org:master
from
JohannesGaessler:cuda-fa-fix-overflow-2
github-actions
added
Nvidia GPU
github-actions
added
ggml
slaren
approved these changes on 2025-07-23
ggerganov
commented on 2025-07-23
CUDA: fix overflow in FA, tune performance
d4209ee4
JohannesGaessler
force pushed
from
ec05b081
to
d4209ee4
48 days ago
ggerganov
approved these changes on 2025-07-23
JohannesGaessler
merged
a86f52b2
into master
48 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
slaren
Assignees
No one assigned
Labels
Nvidia GPU
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub