llama.cpp
fdee152e - starcoder : add GPU offloading (#3827)

Commit
1 year ago
starcoder : add GPU offloading (#3827) * starcoder : do not GPU split 1D bias tensors * starcoder : offload layers to GPU ggml-ci
Author
Parents
  • File
    llama.cpp
Loading