PR #21939 llama-diffusion-cli: read n_ctx back after making llama_context so the cli doesn't reject all input without -c