llama-cpp-python
fix: prevent KV cache corruption on SWA/ISWA models + hot-path perf
#2180
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
fix: prevent KV cache corruption on SWA/ISWA models + hot-path perf
#2180
avion23
wants to merge 1 commit into
abetlen:main
from
avion23:fix/perf-and-iswa
avion23
force pushed
from
5935c64d
to
939fa72d
77 days ago
avion23
force pushed
from
939fa72d
to
9609c824
77 days ago
avion23
changed the title
perf: vectorize hot-path operations + fix SWA/ISWA KV cache corruption (Gemma-4)
fix: prevent KV cache corruption on SWA/ISWA models + hot-path perf
77 days ago
avion23
marked this pull request as ready for review
77 days ago
avion23
force pushed
from
c9bbd6d8
to
3538232d
77 days ago
perf: vectorize hot-path ops, reduce Python overhead, fix SWA/ISWA KV…
e54fe594
avion23
force pushed
from
3538232d
to
e54fe594
76 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
No reviews
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub