llama.cpp
3d3e6bd0 - llama : offload for rest of the model arches

Commit
2 years ago
llama : offload for rest of the model arches
Author
Parents
Loading