llama.cpp
CUDA: refactor and deduplicate vector FA kernels
#16208
Merged

CUDA: refactor and deduplicate vector FA kernels #16208

JohannesGaessler
JohannesGaessler CUDA: refactor and deduplicate vector FA kernels
e2679030
JohannesGaessler JohannesGaessler requested a review from slaren slaren 116 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added python
github-actions github-actions added ggml
JohannesGaessler
JohannesGaessler
JohannesGaessler fix kernel selection logic
8ba0ff79
JohannesGaessler JohannesGaessler force pushed to 8ba0ff79 115 days ago
slaren
slaren approved these changes on 2025-09-27
JohannesGaessler JohannesGaessler merged 75a3a6c2 into master 112 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone