llama.cpp
MPT support in llama.cpp
#3417
Merged

MPT support in llama.cpp #3417

ggerganov merged 17 commits into ggml-org:master from jploski:mpt
jploski
jploski CUDA: added support for ggml_clamp (see also: https://github.com/gger…
b49792b0
jploski mpt : added an implementation based (mostly) on falcon integration, m…
15236e85
cebtenzzre
ggerganov ggerganov added high priority
ggerganov ggerganov added model
cebtenzzre
cebtenzzre commented on 2023-09-30
cebtenzzre
jploski mpt : protect against "clip_qkv": null in mpt-7b
84e30e89
jploski mpt : quick fix to avoid "Strange model" warning when quantizing MPT …
00e8c5c5
jploski
cebtenzzre
cebtenzzre commented on 2023-09-30
cebtenzzre
cebtenzzre commented on 2023-09-30
jploski mpt : addendum to changeset:84e30e8 - leave parameter clamp_kqv out f…
1be89c40
cebtenzzre
cebtenzzre commented on 2023-09-30
jploski mpt : standardized all tensor names to follow GGUF spec
26c253ed
jploski mpt : addendum to changeset:1be89c40 - use "req" parameter of GGUF_GE…
df072d2d
cebtenzzre
cebtenzzre commented on 2023-10-01
jploski mpt : fixed comment s/gptneox/mpt/
90e7d6de
jploski mpt : remove tabs, trailing whitespace
47080129
ggerganov
ggerganov
ggerganov approved these changes on 2023-10-03
jploski mpt : removed ne01 + n_past == ne00 assertion from alibi (cuda/f32) a…
1364bcd7
jploski
ggerganov ggerganov requested a review from goerch goerch 2 years ago
ggerganov ggerganov requested a review from cebtenzzre cebtenzzre 2 years ago
cebtenzzre
cebtenzzre commented on 2023-10-04
jploski mpt : updated convert-mpt-hf-to-gguf.py to reflect changes made to co…
7d6a24aa
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
292363e5
cebtenzzre cebtenzzre force pushed from 39fa4be3 to 292363e5 2 years ago
ggerganov
jploski
cebtenzzre comment out n_past instead of marking it unused
ad3c2f3b
jploski mpt : removed hardcoded +178 from convert script in favor of utilizin…
1a454eb5
cebtenzzre mpt : remove unused tokenizer_json in convert script
32172f12
ggerganov ggml : remove obsolete n_past assert in ggml_alibi
96cf3f5d
ggerganov llama : print clam_kqv and max_alibi_bias hparams
9b66378c
ggerganov ggerganov merged f5f9121d into master 2 years ago
goerch
goerch commented on 2023-10-10
maddes8cht
maddes8cht

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone