ggml : PoC for normalizing weights for better quantization packing #2434
ggerganov
force pushed
from
0dfbd1bd
to
a4d1eb72
2 years ago
ggml : poc for normalizing weights for better quantization (metal)
253eab8a
ggerganov
force pushed
from
dead8f4b
to
253eab8a
2 years ago
ggml : use less ggml_mul tasks when src0 rows are few
df54d2f1
cuda : poc for norm quants (only -b 1 works)
8c2b8812
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub