llama.cpp
f13847cf - server: fix regression on streamed non-chat completion w/ stops (#13785)

Commit
137 days ago
server: fix regression on streamed non-chat completion w/ stops (#13785) * more forgiving message diffs: partial stop words aren't erased, full stops are * Add (slow) server test for completion + stream + stop
Author
Parents
Loading