llama.cpp
e3a2c3fe
- server : use refs + use llama_batch_clear()
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
server : use refs + use llama_batch_clear()
References
#3677 - server : parallel decoding and multimodal (cont)
Author
ggerganov
Parents
3d5929e8
Loading