llama-cpp-python
perf: vectorize KV cache prefix matching with numpy
#2179
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
perf: vectorize KV cache prefix matching with numpy
#2179
nausicaalii
wants to merge 2 commits into
abetlen:main
from
nausicaalii:perf/vectorize-prefix-match
perf: vectorize prefix matching with numpy
d815bba0
refactor: deduplicate prefix matching and eliminate .tolist() overhead
aeb7d7cf
Login to write a write a comment.
Login via GitHub
Reviewers
No reviews
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub