gemma : perform per-layer projections in the first layer #21612
ngxson
approved these changes
on 2026-04-08
gemma : reduce graph splits by keeping per-layer ops in the input layer
9787dd2b
gemma : put the per-layer proj in the first layer
c8752f83
cont : move the projection before the layer loop
f4663616
ggerganov
force pushed
from
3b29d142
to
f4663616
7 days ago
ngxson
approved these changes
on 2026-04-08
ggerganov
merged
5764d7c6
into master 7 days ago
ggerganov
deleted the gg/models-per-layer-fixes-2 branch 7 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub