llama.cpp
113dd600
- server : bach has to be allocated for n_parallel sequences
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
server : bach has to be allocated for n_parallel sequences
References
#3677 - server : parallel decoding and multimodal (cont)
Author
ggerganov
Parents
6b2437e3
Loading