llama.cpp
5764d7c6
- gemma : perform per-layer projections in the first layer (#21612)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 days ago
gemma : perform per-layer projections in the first layer (#21612) * gemma : reduce graph splits by keeping per-layer ops in the input layer * gemma : put the per-layer proj in the first layer * cont : move the projection before the layer loop
References
#21612 - gemma : perform per-layer projections in the first layer
Author
ggerganov
Parents
87f4744a
Loading