llama.cpp
ggml : alternative Q4_3 implementation using modified Q8_0
#1109
Merged

ggml : alternative Q4_3 implementation using modified Q8_0 #1109

ggerganov merged 5 commits into master from q4_3b
ggerganov
ggerganov ggerganov marked this pull request as ready for review 2 years ago
ggerganov
sw
sw commented on 2023-04-21
ggerganov ggml : prefer vzip to vuzp
ec805eef
ggerganov ggml : alternative Q4_3 implementation using modified Q8_0
5425e060
ggerganov ggml : fix Q4_3 scalar imlpementation
829c4806
ggerganov ggml : slight improvement of Q4_3 - no need for loop unrolling
76b6b267
ggerganov ggerganov force pushed to 76b6b267 2 years ago
ggerganov ggml : fix AVX paths for Q8_0 quantization
2c358eca
ggerganov ggerganov merged 955ef9a5 into master 2 years ago
ggerganov ggerganov deleted the q4_3b branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone