llama.cpp
imatrix : offload to GPU support
#4957
Merged

imatrix : offload to GPU support #4957

ggerganov merged 10 commits into master from gg/imatrix-gpu-4931
ggerganov
ggerganov backend : add eval callback
65648b34
ggerganov backend : group nodes in a single compute when user don't need them
01b6f68a
ggerganov backend : clean-up the implementation
83f3d7a8
ggerganov simple : do not perform tensor data copy if not needed
e1b1db9f
ggerganov simple : fix
e0493800
ggerganov imatrix : offload to GPU support
0b2fca9a
ggerganov ggerganov requested a review from ikawrakow ikawrakow 1 year ago
ikawrakow
ikawrakow approved these changes on 2024-01-15
Artefact2
TheBloke
ikawrakow
TheBloke
kalomaze
askmyteapot
ggerganov
ikawrakow
ggerganov imatrix : fix ggml_mul_mat_id hanlding
a722d05a
ggerganov ci : add imatrix test
10b25e03
ggerganov
ggerganov ci : rearrange output
4fb52843
askmyteapot
JianbangZ
Base automatically changed from gg/sched-eval-callback-4931 to master 1 year ago
ggerganov Merge branch 'master' into gg/imatrix-gpu-4931
2917e6b5
ggerganov ggerganov merged ba69bbc8 into master 1 year ago
Mihaiii
Iridescent-gcrace

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone