llama.cpp
CUDA: skip fully masked-out KV in FA vec kernel
#13584
Merged

CUDA: skip fully masked-out KV in FA vec kernel #13584

JohannesGaessler
JohannesGaessler CUDA: skip fully masked-out KV in FA vec kernel
98543709
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
JohannesGaessler
JohannesGaessler fix AMD compilation
69647be0
JohannesGaessler
yeahdongcn
JohannesGaessler
yeahdongcn
ggerganov
ggerganov approved these changes on 2025-05-20
JohannesGaessler JohannesGaessler merged b69f1647 into master 252 days ago
ggerganov
ggerganov commented on 2025-05-24
ggerganov
ggerganov commented on 2025-05-24

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone