CUDA: fastdiv, launch bounds for mmvq + q8_1 quant #15802
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
dbde65ff
use fastfdiv for mul_mat_id modulo
748c6a53
ORippler
approved these changes
on 2025-09-05
slaren
approved these changes
on 2025-09-05
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub