starcoder : add GPU offloading (#3827)

Commit

2 years ago

starcoder : add GPU offloading (#3827) * starcoder : do not GPU split 1D bias tensors * starcoder : offload layers to GPU ggml-ci