llama.cpp
Add support for BERT embedding models
#5423
Merged

Add support for BERT embedding models #5423

iamlemec merged 21 commits into ggml-org:master from iamlemec:bert
iamlemec
cebtenzzre BERT WIP
7286b83d
iamlemec merge from master
ef10d786
iamlemec it runs; tokenization is messed up; pooling is wrong for multi batches
0051c82d
iamlemec add in wordpiece tokenizer
59c1829b
iamlemec put causal_attn flag in gguf
5f1c21d0
iamlemec Merge remote-tracking branch 'origin/master' into bert
e0e14e31
iamlemec Merge remote-tracking branch 'upstream/master' into bert
7218c7b6
cebtenzzre
cebtenzzre commented on 2024-02-08
iamlemec Update convert-hf-to-gguf.py
e3efcf13
iamlemec add causal attention gguf key
96d37f8d
slaren
slaren commented on 2024-02-08
cebtenzzre use ctx_output for tok_norm of BERT and BLOOM
e78388d3
cebtenzzre bert : add some missing graph callbacks
b14c457f
iamlemec fix up model sizing and result acquisition
68758083
iamlemec hard-code token_type = 0
d080bebc
iamlemec Merge branch 'bert' of github.com:iamlemec/llama.cpp into bert
3a1895d7
ggerganov
ggerganov
ggerganov commented on 2024-02-09
iamlemec
iamlemec style fixes
961e98f2
iamlemec undo attempted type_embd simplify
56afb2f6
cebtenzzre bert : simplify token type embedding access
ab49e9ee
cebtenzzre flake8 : add W503 to ignore list
6972e7e9
iamlemec
ggerganov
ggerganov commented on 2024-02-11
ggerganov minor : code style normalization
8fbefed1
ggerganov
ggerganov approved these changes on 2024-02-11
iamlemec avoid use of ggml_graph_get_tensor
e379e8c1
iamlemec Merge branch 'bert' of github.com:iamlemec/llama.cpp into bert
61bab478
iamlemec iamlemec merged 2891c8aa into master 1 year ago
Mihaiii
ggerganov
cebtenzzre
slaren
cebtenzzre
adrianliechti
iamlemec
cebtenzzre
adrianliechti
Hirtol
iamlemec
Solido
cebtenzzre
Solido
cebtenzzre
Solido
cebtenzzre
astrowonk
ditsuke
astrowonk
ditsuke
iamlemec
mofanke
cebtenzzre
mofanke
hiepxanh
mofanke
jkgenser
beyondskyway
ggerganov
mofosyne mofosyne added enhancement
mofosyne mofosyne added model
mofosyne mofosyne added Review Complexity : High
iamlemec
ggerganov
sragrawal
iamlemec
grigohas
iacore
grigohas
iacore

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone