llama.cpp
llama : fix Gemma-2 Query scaling factors
#8473
Merged

llama : fix Gemma-2 Query scaling factors #8473

ggerganov merged 2 commits into master from gg/gemma-2-fix-q-scale
ggerganov
danielhanchen 9B - query_pre_attn_scalar = 256 not 224
df78f196
ggerganov llama : fix Gemma-2 Query scaling factor
acc877f4
github-actions github-actions added python
mofosyne mofosyne added Review Complexity : Low
ggerganov ggerganov merged 73cf442e into master 1 year ago
ggerganov ggerganov deleted the gg/gemma-2-fix-q-scale branch 1 year ago
danielhanchen

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone