llama.cpp
Streamline embeddings from "non-embedding" models
#8087
Merged

Streamline embeddings from "non-embedding" models #8087

iamlemec
iamlemec iamlemec force pushed 1 year ago
iamlemec iamlemec requested a review from compilade compilade 1 year ago
ggerganov
ggerganov commented on 2024-06-24
mofosyne mofosyne added Review Complexity : Low
compilade
compilade approved these changes on 2024-06-24
ggerganov
ggerganov approved these changes on 2024-06-25
compilade
compilade approved these changes on 2024-06-27
iamlemec fix microbatch output counting; add attention_type context parameter
9fa007c8
iamlemec iamlemec force pushed to 9fa007c8 1 year ago
iamlemec
ggerganov ggerganov merged d12f7810 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone