llama.cpp
CUDA: skip masked KV slices for all FA kernels
#14924
Merged

CUDA: skip masked KV slices for all FA kernels #14924

JohannesGaessler
JohannesGaessler CUDA: skip masked KV slices for all FA kernels
c113ed79
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
am17an
JohannesGaessler
JohannesGaessler
JohannesGaessler
am17an
JohannesGaessler
ggerganov
JohannesGaessler
JohannesGaessler
ggerganov
ggerganov
JohannesGaessler
ggerganov
ggerganov
JohannesGaessler
JohannesGaessler
JohannesGaessler
ggerganov
JohannesGaessler
ggerganov
ggerganov approved these changes on 2025-07-30
JohannesGaessler JohannesGaessler merged 92b8810e into master 228 days ago
JohannesGaessler
ggerganov
JohannesGaessler
ggerganov
JohannesGaessler
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone