llama.cpp
gemma : allow offloading the output tensor
#5646
Merged

Loading