llama.cpp
Slight quantization improvement for Q4_K and Q5_K
#5361
Merged

Slight quantization improvement for Q4_K and Q5_K #5361

ikawrakow merged 2 commits into master from ik/q4k_tuning
ikawrakow
Q4_K: slightly better quantization
f58d49e5
Q5_K: slightly better quantization
d3cc1533
ggerganov
ggerganov approved these changes on 2024-02-06
BarfingLemurs
ikawrakow
ikawrakow ikawrakow merged f57fadc0 into master 1 year ago
ikawrakow ikawrakow deleted the ik/q4k_tuning branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone