llama-cpp-python
This PR implements the previously stubbed state management methods in the _internals.py module and updates the corresponding API calls in llama.py to use the correct underlying C++ function names.
#2134
Open

This PR implements the previously stubbed state management methods in the _internals.py module and updates the corresponding API calls in llama.py to use the correct underlying C++ function names. #2134

bsides230 wants to merge 6 commits into abetlen:main from bsides230:kv-caching-issue
bsides230
codavidgarcia feat: update llama.cpp submodule and bindings for Qwen 3.5 support
d21ef679
r-dh fix: set BUILD_NUMBER and LLAMA_INSTALL_VERSION for mtmd build
eacc2584
r-dh fix: return bool from kv_cache_seq_rm for partial removal detection
01248477
r-dh fix: handle GDN hybrid models that reject partial memory removal
47aedc22
bsides230 Update llama.cpp submodule to latest ggml-org
2ee9d3d3
bsides230 This PR implements the previously stubbed state management methods in…
1a4f7589

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone