llama.cpp
ggml: allow prefetching tensor overrides
#21067
Open

ggml: allow prefetching tensor overrides #21067

am17an wants to merge 4 commits into ggml-org:master from am17an:bunch-moe-transfer
am17an
am17an ggml-backend: prefetch weights async
dd6d1e8a
github-actions github-actions added Nvidia GPU
github-actions github-actions added examples
github-actions github-actions added ggml
github-actions github-actions added SYCL
github-actions github-actions added Ascend NPU
github-actions github-actions added OpenCL
github-actions github-actions added IBM zDNN
github-actions github-actions added OpenVINO
github-actions github-actions added WebGPU
am17an simplify
e62a3e40
am17an add dedicated events
8f48f029
am17an copy_stream false to other backends
4d78c9d1
am17an
am17an am17an requested a review from JohannesGaessler JohannesGaessler 4 days ago
am17an am17an requested a review from ggerganov ggerganov 4 days ago
github-actions github-actions added Vulkan
github-actions github-actions added Apple Metal
github-actions github-actions added Hexagon
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone