llama.cpp
gemma : more consistent attention scaling for v2 and v3
#13951
Merged

gemma : more consistent attention scaling for v2 and v3 #13951

ggerganov merged 3 commits into master from gg/gemma-fix-attn-scale
ggerganov
ggerganov gemma : fix attn scale for 27B
36469ad8
ggerganov cont : apply scale before attn
67c4346e
ggerganov ggerganov marked this pull request as draft 103 days ago
ggerganov cont : consistent attention scaling
fbc6df02
ggerganov ggerganov changed the title gemma : fix attn scale for 27B gemma : more consistent attention scaling for v2 and v3 102 days ago
ggerganov ggerganov marked this pull request as ready for review 102 days ago
ggerganov ggerganov merged 5582c49c into master 102 days ago
ggerganov ggerganov deleted the gg/gemma-fix-attn-scale branch 102 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone