llama.cpp
metal : make the backend async
#15832
Closed

metal : make the backend async #15832

ggerganov wants to merge 9 commits into master from gg/metal-async
ggerganov
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
slaren
ggerganov
slaren
ggerganov
ggerganov ggerganov force pushed from 2b8074d9 to 84ae8368 2 days ago
ggerganov
slaren
ggerganov ggerganov force pushed from 84ae8368 to c4fe8b0f 2 days ago
ggerganov ggerganov marked this pull request as ready for review 2 days ago
ggerganov
ggerganov commented on 2025-09-08
ggerganov metal : make the backend async
97b96c1a
ggerganov cont : add comments, extend op offload, clean up
c5637cf3
ggerganov metal : fix batch size for MUL_MAT_ID
bdff7729
ggerganov metal : remove deprecated ggml_backend_metal_buffer_from_ptr
d91ba85d
ggerganov ggerganov force pushed from 1526218e to d91ba85d 1 day ago
ggerganov ggerganov force pushed from f369bdb7 to 8d4835d3 1 day ago
ggerganov
ggerganov commented on 2025-09-09
ggerganov metal : create only metal buffers, no wrapping of host memory
85aaf52b
ggerganov ggerganov force pushed from 8d4835d3 to 85aaf52b 1 day ago
ggerganov
ggerganov commented on 2025-09-09
ggerganov
ggerganov commented on 2025-09-09
ggerganov metal : restore .alloc_buffer for buffer_from_ptr_type
7fc2b3d5
ggerganov metal : remove broken implementation of GGML_OP_SET
f288225d
ggerganov metal : clean-up loose ends, ready for tests
0926cb49
ggerganov metal : back to a single queue per device
3f62ee8b
slaren
slaren commented on 2025-09-09
ggerganov
ggerganov ggerganov closed this 7 hours ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone