llama.cpp
ggml: allow prefetching tensor overrides
#21067
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
ggml: allow prefetching tensor overrides
#21067
am17an
wants to merge 4 commits into
ggml-org:master
from
am17an:bunch-moe-transfer
ggml-backend: prefetch weights async
dd6d1e8a
github-actions
added
Nvidia GPU
github-actions
added
examples
github-actions
added
ggml
github-actions
added
SYCL
github-actions
added
Ascend NPU
github-actions
added
OpenCL
github-actions
added
IBM zDNN
github-actions
added
OpenVINO
github-actions
added
WebGPU
simplify
e62a3e40
add dedicated events
8f48f029
copy_stream false to other backends
4d78c9d1
am17an
requested a review
from
JohannesGaessler
4 days ago
am17an
requested a review
from
ggerganov
4 days ago
github-actions
added
Vulkan
github-actions
added
Apple Metal
github-actions
added
Hexagon
Login to write a write a comment.
Login via GitHub
Reviewers
JohannesGaessler
ggerganov
Assignees
No one assigned
Labels
Nvidia GPU
Vulkan
examples
ggml
SYCL
Apple Metal
Ascend NPU
OpenCL
IBM zDNN
Hexagon
OpenVINO
WebGPU
Milestone
No milestone
Login to write a write a comment.
Login via GitHub