llama.cpp
ggml : PoC for normalizing weights for better quantization packing
#2434
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
Commits
ggml : poc for normalizing weights for better quantization (metal)
ggerganov
committed
2 years ago
ggml : use less ggml_mul tasks when src0 rows are few
ggerganov
committed
2 years ago
cuda : poc for norm quants (only -b 1 works)
ggerganov
committed
2 years ago
Loading