llama.cpp
ggml-zendnn : adaptive fallback to CPU backend for small batch sizes
#22681
Merged

ggml-zendnn : adaptive fallback to CPU backend for small batch sizes #22681

z-sachin
z-sachin ggml-zendnn : add runtime env var GGML_ZENDNN_ADAPTIVE_FALLBACK to co…
a38a4ca6
ggml-gh-bot
github-actions github-actions added ggml
github-actions github-actions added AMD ZenDNN
taronaeo
z-vishal
z-sachin ggml-zendnn : restore original fallback logic when adaptive fallback …
199e8f29
z-vishal
z-vishal
z-vishal approved these changes on 2026-05-12
taronaeo
taronaeo approved these changes on 2026-05-12
taronaeo taronaeo added merge ready
taronaeo
am17an
am17an approved these changes on 2026-05-12
lhez
lhez approved these changes on 2026-05-12
ggerganov ggerganov merged 61af07c2 into master 35 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone