llama.cpp
llama-diffusion-cli: read n_ctx back after making llama_context so the cli doesn't reject all input without -c
#21939
Merged

Loading