llama.cpp
9ca79d5c - kv cache slot search improvements (#3493)

Commit

2 years ago

kv cache slot search improvements (#3493) * kv cache slot search improvements * Use n_ctx in kv find slot for consistency * Ensure kv cache head points to a valid slot in llama_decode internal * Add some comments to prevent dumb people (like me) from getting confused.

References

#3493 - kv cache slot search improvements

Author

KerfuffleV2

Parents

0c731ca4

llama.cpp 9ca79d5c - kv cache slot search improvements (#3493)

llama.cpp
9ca79d5c - kv cache slot search improvements (#3493)