llama.cpp
a8063171
- server : completion requests remember slot_id
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
server : completion requests remember slot_id
References
#3677 - server : parallel decoding and multimodal (cont)
Author
ggerganov
Parents
f305d643
Loading