llama.cpp
CUDA: tune GLM 4.7 Flash FA kernel selection logic
#19097
Merged

CUDA: tune GLM 4.7 Flash FA kernel selection logic #19097

JohannesGaessler
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
ggerganov approved these changes on 2026-01-26
jacekpoplawski
jacekpoplawski
JohannesGaessler
jacekpoplawski
JohannesGaessler CUDA: tune GLM 4.7 Flash FA kernel selection logic
bee6c679
JohannesGaessler JohannesGaessler force pushed from 134adcac to bee6c679 132 days ago
JohannesGaessler
jacekpoplawski
JohannesGaessler JohannesGaessler merged a5bb8ba4 into master 131 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone