llama.cpp
03bf161e - llama : support batched embeddings (#5466)

Commit
1 year ago
llama : support batched embeddings (#5466) * batched embedding: pool outputs by sequence id. updated embedding example * bring back non-causal attention * embd : minor improvements * llama : minor --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Author
Parents
Loading