llama.cpp
ba69bbc8 - imatrix : offload to GPU support (#4957)

Commit

2 years ago

imatrix : offload to GPU support (#4957) * backend : add eval callback ggml-ci * backend : group nodes in a single compute when user don't need them * backend : clean-up the implementation ggml-ci * simple : do not perform tensor data copy if not needed * simple : fix * imatrix : offload to GPU support * imatrix : fix ggml_mul_mat_id hanlding ggml-ci * ci : add imatrix test ggml-ci * ci : rearrange output ggml-ci

References

#4957 - imatrix : offload to GPU support

Author

ggerganov

Parents

44a1a4a4

llama.cpp ba69bbc8 - imatrix : offload to GPU support (#4957)

llama.cpp
ba69bbc8 - imatrix : offload to GPU support (#4957)