llama.cpp
ggml-zendnn : adaptive fallback to CPU backend for small batch sizes
#22681

Merged

ggml-zendnn : adaptive fallback to CPU backend for small batch sizes #22681

ggerganov merged 2 commits into ggml-org:master from z-sachin:ggml-zendnn/adaptive-fallback-env

ggml-zendnn : add runtime env var GGML_ZENDNN_ADAPTIVE_FALLBACK to co…

a38a4ca6

github-actions added ggml

github-actions added AMD ZenDNN

ggml-zendnn : restore original fallback logic when adaptive fallback …

199e8f29

z-vishal approved these changes on 2026-05-12

taronaeo approved these changes on 2026-05-12

taronaeo added merge ready

am17an approved these changes on 2026-05-12

lhez approved these changes on 2026-05-12

ggerganov merged 61af07c2 into master 35 days ago

Reviewers

lhez

am17an

taronaeo

z-vishal

Assignees

No one assigned

Labels

ggml merge ready AMD ZenDNN

Milestone

No milestone