llama.cpp
gemma : use more bits for the token_embd.weight tensor
#5650
Merged

gemma : use more bits for the token_embd.weight tensor #5650

ggerganov merged 2 commits into master from gg/improve-gemma-quants
ggerganov
ggerganov gemma : use Q8_0 for the token_embd.weight tensor
f181e601
slaren
hannibalhuang
ggerganov llama : quantize token_embd.weight using output type
488bd973
ggerganov
jingnanzhou
ggerganov ggerganov changed the title gemma : use Q8_0 for the token_embd.weight tensor gemma : use more bits for the token_embd.weight tensor 1 year ago
ggerganov ggerganov merged 96633eec into master 1 year ago
ggerganov ggerganov deleted the gg/improve-gemma-quants branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone