llama.cpp
5764d7c6 - gemma : perform per-layer projections in the first layer (#21612)

Commit
3 days ago
gemma : perform per-layer projections in the first layer (#21612) * gemma : reduce graph splits by keeping per-layer ops in the input layer * gemma : put the per-layer proj in the first layer * cont : move the projection before the layer loop
Author
Parents
Loading