llama.cpp
CUDA: fix FP16 overflow in tile FA kernel
#17875
Merged

CUDA: fix FP16 overflow in tile FA kernel #17875

JohannesGaessler
JohannesGaessler CUDA: fix FP16 overflow in tile FA kernel
cb9ac467
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
ggerganov approved these changes on 2025-12-09
JohannesGaessler JohannesGaessler merged 0cdce38a into master 24 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone