starcoder : add GPU offloading #3827
starcoder : do not GPU split 1D bias tensors
53ab0535
starcoder : offload layers to GPU
731dd98b
ggerganov
merged
fdee152e
into master 1 year ago
ggerganov
deleted the starcoder-cuda branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub