llama.cpp
feat: add potential to run Jina Embeddings architecture
#6826
Merged

feat: add potential to run Jina Embeddings architecture #6826

JoanFM
JoanFM feat: first things to do
86a5d96f
JoanFM feat: create tensors for Jina architecture
747d17a6
JoanFM fix: use other tensors
a40156a0
JoanFM feat: embedding gets results
b00d38b0
JoanFM fix: fix usage of ALIBI
cf1c1447
JoanFM fix: clean prints
63a1d7c0
JoanFM fix: do some cleanup unused vars
c229e489
JoanFM fix: revert changes to Makefile and CMakeLists
e2323706
JoanFM fix: revert some changes
795ff1d3
JoanFM fix: fix small detail
d6ac931b
JoanFM JoanFM changed the title Feat jina embeddings (DRAFT) feat: add potential to run Jina Embeddings architecture 1 year ago
JoanFM Merge branch 'master' into feat-jina-embeddings
db7e8ce5
JoanFM fix: fix convert formatting
c1c0f4d8
JoanFM fix: fix linting and editor
64cd4b13
JoanFM feat: set proper vocab settings
71ff763e
JoanFM JoanFM marked this pull request as ready for review 1 year ago
JoanFM JoanFM changed the title (DRAFT) feat: add potential to run Jina Embeddings architecture feat: add potential to run Jina Embeddings architecture 1 year ago
JoanFM
JoanFM fix: JinaBertForMaskedLM registration
d7d6a4ed
JoanFM JoanFM force pushed from e946cb09 to d7d6a4ed 1 year ago
ggerganov
JoanFM feat: support q_normalization and k_normalization in Jina arch
cde49b74
JoanFM feat: handle gpt2 tokenizer with Jina architecture
dd060a2a
JoanFM feat: example comments in embedding
dfa06763
JoanFM feat: rename Jina Bert to Jina Bert V2
c3f4b1f2
JoanFM Merge branch 'master' into feat-jina-embeddings
f8d17090
JoanFM
JoanFM fix: add some changes as per review
d9b8dd66
JoanFM JoanFM force pushed from da963685 to d9b8dd66 1 year ago
ggerganov
JoanFM JoanFM marked this pull request as draft 1 year ago
JoanFM
github-actions
JoanFM feat: proper KQ_pos for Jina embeddings
14073a2c
JoanFM JoanFM marked this pull request as ready for review 1 year ago
JoanFM
JoanFM Merge branch 'master' of https://github.com/JoanFM/llama.cpp into fea…
76436c19
JoanFM
JoanFM feat: add capacity to load models ES and DE for Spanish
cf9fcd83
JoanFM Merge branch 'master' into feat-jina-embeddings
e59b5465
ggerganov ggerganov requested a review from ggerganov ggerganov 1 year ago
ggerganov llama : fix pre-tokenizers
b7ede482
JoanFM Merge branch 'master' of https://github.com/JoanFM/llama.cpp into fea…
8e36fd5a
JoanFM Merge branch 'master' of https://github.com/JoanFM/llama.cpp into fea…
849aeda2
JoanFM
ggerganov
JoanFM Merge branch 'master' of https://github.com/JoanFM/llama.cpp into fea…
ee3250da
mofosyne mofosyne added Review Complexity : High
mofosyne mofosyne added enhancement
ggerganov
ggerganov ggml : full ALiBi support
7fdca334
ggerganov ggml : update ggml_soft_max_ext() CUDA, SYCL
d0592d49
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (CPU)
166e60bf
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (Metal)
97c27f59
ggerganov ggml : fix warning
f7055d31
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (CUDA)
865af990
ggerganov Merge remote-tracking branch 'origin/gg/refactor-alibi-2' into HEAD
d9adb883
ggerganov ggerganov changed the base branch from master to gg/refactor-alibi-2 1 year ago
ggerganov minor : clean-up
a1278f13
ggerganov ggerganov changed the base branch from gg/refactor-alibi-2 to master 1 year ago
ggerganov Merge branch 'master' into HEAD
23499b81
ggerganov
ggerganov approved these changes on 2024-05-11
ggerganov embedding : add warning about missing SEP
49b3dbbe
ggerganov ggerganov merged b83cc3f5 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone