llama.cpp
CUDA: fix FA VKQ accumulator overflow
#17746
Merged

CUDA: fix FA VKQ accumulator overflow #17746

JohannesGaessler
gabe-l-hart
gabe-l-hart
JohannesGaessler
gabe-l-hart
CISC
CISC approved these changes on 2025-12-03
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
ggerganov approved these changes on 2025-12-04
JohannesGaessler
ggerganov
JohannesGaessler CUDA: fix FA VKQ accumulator overflow
59e6cba1
JohannesGaessler JohannesGaessler force pushed from 1dd12722 to 59e6cba1 16 days ago
JohannesGaessler
JohannesGaessler JohannesGaessler merged e95d0bc8 into master 15 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone