llama.cpp
starcoder : add GPU offloading
#3827
Merged

starcoder : add GPU offloading #3827

ggerganov merged 2 commits into master from starcoder-cuda
ggerganov
ggerganov starcoder : do not GPU split 1D bias tensors
53ab0535
ggerganov starcoder : offload layers to GPU
731dd98b
ggerganov ggerganov merged fdee152e into master 1 year ago
ggerganov ggerganov deleted the starcoder-cuda branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone