llama.cpp
server: Ensure batches are either all embed or all completion (#8076)
#8420

Merged

server: Ensure batches are either all embed or all completion (#8076) #8420

ggerganov merged 2 commits into ggml-org:master from iamlemec:server-embed

make sure batches are all embed or all non-embed

40c99abb

github-actions added examples

github-actions added server

iamlemec changed the title ~~server: Ensure batches are either all embed or all completion~~ server: Ensure batches are either all embed or all completion (#8076) 1 year ago

compilade commented on 2024-07-10

non-embedding batch for sampled tokens; fix unused params warning

371cb8df

ggerganov approved these changes on 2024-07-12

ggerganov merged c3ebcfa1 into master 1 year ago

Reviewers

ggerganov

compilade

Assignees

No one assigned

Labels

examples server

Milestone

No milestone

llama.cpp server: Ensure batches are either all embed or all completion (#8076) #8420 Merged

server: Ensure batches are either all embed or all completion (#8076) #8420

llama.cpp
server: Ensure batches are either all embed or all completion (#8076)
#8420

Merged