llama.cpp
ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH
#18535
Merged

ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH #18535

DocShotgun
DocShotgun ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH
3c1bcf26
DocShotgun DocShotgun requested a review from 0cc4m 0cc4m 34 days ago
DocShotgun DocShotgun requested a review from ggerganov ggerganov 34 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added Vulkan
github-actions github-actions added ggml
github-actions github-actions added SYCL
github-actions github-actions added Apple Metal
github-actions github-actions added Ascend NPU
ggerganov
ggerganov commented on 2026-01-02
DocShotgun ggml: read GGML_OP_OFFLOAD_MIN_BATCH once and store to dev ctx
fa467740
DocShotgun cann: forward declaration of device context struct
a449358f
DocShotgun cann: move offload op check after device context declaration
7a838e78
DocShotgun
NeoZhangJianyu
NeoZhangJianyu commented on 2026-01-04
0cc4m
am17an
am17an approved these changes on 2026-01-06
DocShotgun cuda: fix whitespace
919aa4f9
am17an
ggerganov ggerganov merged 9a5724de into master 28 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone