llama.cpp
Possible solution to allow K-quants on models with n_vocab!=32000
#2148
Merged

Possible solution to allow K-quants on models with n_vocab!=32000 #2148

LostRuins
LostRuins This allows LLAMA models that were previously incompatible with K qua…
18541688
LostRuins LostRuins marked this pull request as ready for review 2 years ago
JohannesGaessler
LostRuins
ikawrakow
TheBloke
ggerganov
ggerganov approved these changes on 2023-07-09
jxy
LostRuins Fix indentation
048dca98
LostRuins As an alternative, to avoid failing on Metal due to lack of Q8_0 supp…
fd9a2fdf
TheBloke
LostRuins
LostRuins
LostRuins LostRuins merged bbef2821 into master 2 years ago
LostRuins LostRuins deleted the kquant_vocab_fix branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone