llama.cpp
ggml : PoC for normalizing weights for better quantization packing
#2434
Open

ggml : PoC for normalizing weights for better quantization packing #2434

ggerganov wants to merge 3 commits into master from norm-quants
ggerganov
ggerganov ggerganov added demo
ggerganov ggerganov force pushed from 0dfbd1bd to a4d1eb72 2 years ago
ikawrakow
JohannesGaessler
JohannesGaessler
ggerganov
ikawrakow
ikawrakow
ggerganov
JohannesGaessler
klosax
JohannesGaessler
klosax
JohannesGaessler
philpax
klosax
LostRuins
ggerganov ggml : poc for normalizing weights for better quantization (metal)
253eab8a
ggerganov ggerganov force pushed from dead8f4b to 253eab8a 2 years ago
ggerganov ggml : use less ggml_mul tasks when src0 rows are few
df54d2f1
cebtenzzre
cebtenzzre commented on 2023-08-30
ggerganov cuda : poc for norm quants (only -b 1 works)
8c2b8812
KerfuffleV2

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone