llama.cpp
llama : fix embeddings
#5796
Merged

llama : fix embeddings #5796

ggerganov merged 9 commits into master from gg/fix-embeddings
ggerganov
cebtenzzre
cebtenzzre
cebtenzzre commented on 2024-02-29
ggerganov ggerganov marked this pull request as draft 1 year ago
iamlemec
iamlemec commented on 2024-02-29
tybalex
iamlemec
ggerganov llama : fix embeddings
d0347840
ggerganov ggerganov force pushed from 008f3fc7 to d0347840 1 year ago
ggerganov
ggerganov commented on 2024-03-04
ggerganov llama : do not use KV cache for non-causal models
eb425962
ggerganov embeddings : fix llama_batch_init arg
9bbeb0f1
ggerganov llama : add pooling switch
e66da356
ggerganov ggerganov marked this pull request as ready for review 1 year ago
cebtenzzre
cebtenzzre commented on 2024-03-04
iamlemec
iamlemec commented on 2024-03-04
cebtenzzre
ggerganov
cebtenzzre
iamlemec
iamlemec commented on 2024-03-04
ggerganov llama : distinguish token vs sequence embeddings
79e4eede
ggerganov llama : assert pooling tensor
fc9af156
ggerganov
ggerganov llama : simplify causal mask condition
c23c5547
ggerganov llama : assert input batch with pooling enabled
1af2d061
ggerganov readme : update API changes list
7cafaa47
ggerganov ggerganov merged 29ae62d2 into master 1 year ago
ggerganov ggerganov deleted the gg/fix-embeddings branch 1 year ago
iamlemec
cebtenzzre
cebtenzzre commented on 2024-03-04
ggerganov
iamlemec
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone