gemma : use more bits for the token_embd.weight tensor #5650
gemma : use Q8_0 for the token_embd.weight tensor
f181e601
llama : quantize token_embd.weight using output type
488bd973
ggerganov
changed the title gemma : use Q8_0 for the token_embd.weight tensor gemma : use more bits for the token_embd.weight tensor 1 year ago
ggerganov
merged
96633eec
into master 1 year ago
ggerganov
deleted the gg/improve-gemma-quants branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub