llama.cpp
llama : make quantize example up to 2.7x faster
#3115
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
Commits
llama : refactor k-quant mixture logic into a function
cebtenzzre
committed
2 years ago
llama : optimize vector use in quantize -> 179% faster
cebtenzzre
committed
2 years ago
llama : don't zero-init vectors in quantize -> 5.1% faster
cebtenzzre
committed
2 years ago
Loading