Possible solution to allow K-quants on models with n_vocab!=32000 #2148
This allows LLAMA models that were previously incompatible with K qua…
18541688
LostRuins
marked this pull request as ready for review 2 years ago
ggerganov
approved these changes
on 2023-07-09
Fix indentation
048dca98
As an alternative, to avoid failing on Metal due to lack of Q8_0 supp…
fd9a2fdf
LostRuins
merged
bbef2821
into master 2 years ago
LostRuins
deleted the kquant_vocab_fix branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub