llama.cpp
gemma : use more bits for the token_embd.weight tensor
#5650

Merged

gemma : use more bits for the token_embd.weight tensor #5650

ggerganov merged 2 commits into master from gg/improve-gemma-quants

gemma : use Q8_0 for the token_embd.weight tensor

f181e601

llama : quantize token_embd.weight using output type

488bd973

ggerganov changed the title ~~gemma : use Q8_0 for the token_embd.weight tensor~~ gemma : use more bits for the token_embd.weight tensor 2 years ago

ggerganov merged 96633eec into master 2 years ago

ggerganov deleted the gg/improve-gemma-quants branch 2 years ago

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

llama.cpp gemma : use more bits for the token_embd.weight tensor #5650 Merged

gemma : use more bits for the token_embd.weight tensor #5650

llama.cpp
gemma : use more bits for the token_embd.weight tensor
#5650

Merged