llama.cpp
ggml : use 8-bit precision for Q4_1 intermediate results
#1047
Merged

ggml : use 8-bit precision for Q4_1 intermediate results #1047

ggerganov merged 4 commits into master from q4_1xq8_0
ggerganov
ggerganov ggerganov added high priority
ggerganov ggerganov added generation quality
dfyz
dfyz
ggerganov
dfyz
ggerganov ggml : use 8-bit precision for Q4_1 intermediate results (ARM)
e9c07f72
ggerganov ggml : optimize ggml_vec_dot_q4_1_q8_0() via vmalq_n_f32
42623052
slaren ggml : AVX2 implementation of ggml_vec_dot_q4_1_q8_0 (#1051)
ad7007aa
ggerganov gitignore : ignore ppl-*.txt files
e582f2ad
ggerganov ggerganov force pushed to e582f2ad 2 years ago
ggerganov
ggerganov ggerganov merged 884e7d7a into master 2 years ago
ggerganov ggerganov deleted the q4_1xq8_0 branch 2 years ago
teaalltr
ggerganov ggerganov assigned ggerganov ggerganov 2 years ago
ggerganov ggerganov assigned slaren slaren 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
Labels
Milestone