llama.cpp
server : refactor slot input data, move tokenizer to HTTP thread
#10023
Merged

server : refactor slot input data, move tokenizer to HTTP thread #10023

ngxson
ngxson server : refactor slot input data, move tokenizer to HTTP thread
125835b2
github-actions github-actions added examples
github-actions github-actions added server
ngxson move prompt_tokens.empty() check
5c749bea
ngxson Merge branch 'master' into xsn/refactor_server_slot_input
3abc3396
ngxson fix incorrect if branch
60d4194b
ngxson fix infinite generation loop
b550011b
ngxson
ggerganov
ngxson bring back infill validation
cff97ad3
ngxson add infill test
fea5ca45
ngxson try fixing format_infill
07381f7d
ngxson fix test
c34ab08a
github-actions github-actions added python
ngxson
ngxson remove redundant code
575b1332
ngxson rename completion to inference
4a9f3e76
ngxson update docs
13ee7793
ngxson ngxson marked this pull request as ready for review 351 days ago
ngxson ngxson requested a review from ggerganov ggerganov 351 days ago
ngxson use llama_tokens everywhere
7f7acdbe
ggerganov
ggerganov
ggerganov approved these changes on 2024-10-24
wwoodsTM
ngxson ngxson merged 958367bf into master 351 days ago
chrisstankevitz
ngxson
chrisstankevitz
ngxson
chrisstankevitz
chrisstankevitz
ngxson
ngxson commented on 2024-10-31
sasha0552

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone