llama.cpp
Possible solution to allow K-quants on models with n_vocab!=32000
#2148

Merged

Possible solution to allow K-quants on models with n_vocab!=32000 #2148

LostRuins merged 3 commits into ggml-org:master from LostRuins:kquant_vocab_fix

This allows LLAMA models that were previously incompatible with K qua…

18541688

LostRuins marked this pull request as ready for review 2 years ago

ggerganov approved these changes on 2023-07-09

Fix indentation

048dca98

As an alternative, to avoid failing on Metal due to lack of Q8_0 supp…

fd9a2fdf

LostRuins merged bbef2821 into master 2 years ago

LostRuins deleted the kquant_vocab_fix branch 2 years ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

None yet

Milestone

No milestone