llama.cpp
CUDA: faster dequantize kernels for Q4_0 and Q4_1
#4938
Merged

CUDA: faster dequantize kernels for Q4_0 and Q4_1 #4938

ikawrakow
CUDA: faster dequantize kernels for Q4_0 and Q4_1
08b89f7e
JohannesGaessler
JohannesGaessler commented on 2024-01-14
JohannesGaessler
JohannesGaessler approved these changes on 2024-01-14
JohannesGaessler
kalomaze
ikawrakow ikawrakow merged 4a3156de into master 2 years ago
ikawrakow ikawrakow deleted the ik/cuda_faster_legacy_dequantize branch 2 years ago
JohannesGaessler
ggerganov
ggerganov commented on 2024-01-15
ikawrakow
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone