llama.cpp
CUDA: fix FA FP16 accumulator overflow for Granite
#18614
Merged

CUDA: fix FA FP16 accumulator overflow for Granite #18614

JohannesGaessler
JohannesGaessler CUDA: fix FA FP16 accumulator overflow for Granite
1d875330
ggerganov
ggerganov approved these changes on 2026-01-05
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler JohannesGaessler merged df17a4c9 into master 51 days ago
broadbit-hu
JohannesGaessler
broadbit-hu
JohannesGaessler
broadbit-hu

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone