Streamline embeddings from "non-embedding" models #8087
compilade
approved these changes
on 2024-06-24
ggerganov
approved these changes
on 2024-06-25
compilade
approved these changes
on 2024-06-27
fix microbatch output counting; add attention_type context parameter
9fa007c8
iamlemec
force pushed
to
9fa007c8
1 year ago
ggerganov
merged
d12f7810
into master 1 year ago
Assignees
No one assigned
Labels
Review Complexity : Low
Login to write a write a comment.
Login via GitHub