llama.cpp
server : pad small embedding batches
#13692
Merged

server : pad small embedding batches #13692

ggerganov
ggerganov server : pad small embedding batches
3c16df11
ggerganov ggerganov requested a review from ngxson ngxson 206 days ago
ngxson
ngxson approved these changes on 2025-05-21
github-actions github-actions added examples
github-actions github-actions added server
aviallon
ggerganov
ggerganov ggerganov merged cc74d5be into master 205 days ago
ggerganov ggerganov deleted the gg/server-fix-pooling-small-batches branch 205 days ago
aviallon

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone