llama.cpp
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
#15802
Merged

CUDA: fastdiv, launch bounds for mmvq + q8_1 quant #15802

JohannesGaessler
JohannesGaessler CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
dbde65ff
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ORippler
ORippler commented on 2025-09-05
JohannesGaessler use fastfdiv for mul_mat_id modulo
748c6a53
JohannesGaessler
ORippler
ORippler approved these changes on 2025-09-05
slaren
slaren approved these changes on 2025-09-05
JohannesGaessler JohannesGaessler merged 5143fa89 into master 62 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone