llama.cpp
llama: implement YaRN RoPE scaling
#2268
Merged

llama: implement YaRN RoPE scaling #2268

cebtenzzre merged 36 commits into ggml-org:master from cebtenzzre:ntkv2
cebtenzzre
cebtenzzre cebtenzzre force pushed 2 years ago
cebtenzzre cebtenzzre force pushed 2 years ago
cebtenzzre cebtenzzre force pushed 2 years ago
cebtenzzre cebtenzzre changed the title llama: implement NTK-By-Parts (NTKv2) llama: implement NTK-By-Parts (NTKv2) RoPE scaling 2 years ago
cebtenzzre cebtenzzre force pushed 2 years ago
FNsi
cebtenzzre
cebtenzzre cebtenzzre force pushed 2 years ago
cebtenzzre cebtenzzre marked this pull request as ready for review 2 years ago
cebtenzzre
ggerganov
ggerganov commented on 2023-07-22
ggerganov
cebtenzzre llama: implement NTK-By-Parts (NTKv2) RoPE scaling
8dec38c3
cebtenzzre CUDA implementation
6aeb46b3
cebtenzzre Metal implementation
9348aa4d
cebtenzzre cebtenzzre force pushed to 9348aa4d 2 years ago
cebtenzzre
cebtenzzre
cebtenzzre
cebtenzzre implement new YaRN algorithm
a30ae209
cebtenzzre cebtenzzre changed the title llama: implement NTK-By-Parts (NTKv2) RoPE scaling llama: implement YaRN RoPE scaling 2 years ago
cebtenzzre
bloc97
cebtenzzre
KerfuffleV2
Green-Sky
cebtenzzre cebtenzzre marked this pull request as draft 2 years ago
bloc97
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
b5ced4fb
cebtenzzre ggml : increase GGML_MAX_OP_PARAMS
826269ad
cebtenzzre YaRN : avoid NaN if unused betas are zero
cf731d56
cebtenzzre YaRN : fix missing parameter in CUDA impl
dcb058ce
cebtenzzre convert : reduce unnecessary variables in Params
281b26e6
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
a06c7292
cebtenzzre llama : simplify use of context params
dc26a0dd
cebtenzzre llama : store YaRN parameters in GGUF
904d4edf
cebtenzzre fix convert scripts
56abb9a4
cebtenzzre cebtenzzre force pushed to 56abb9a4 2 years ago
cebtenzzre llama : fix C compatibility
43eaf06a
cebtenzzre don't hardcode max_pos_emb
fe788c45
cebtenzzre cebtenzzre marked this pull request as ready for review 2 years ago
Green-Sky
Green-Sky commented on 2023-09-21
Green-Sky
Green-Sky commented on 2023-09-21
cebtenzzre address review comments
e0b120c3
cebtenzzre
cebtenzzre commented on 2023-09-21
cebtenzzre restore backwards compatiblity with *.rope.scale_linear
19bb74e7
cebtenzzre better option descriptions in help
4d5fe734
cebtenzzre gguf : store scaling type as a string instead of an int
74664157
cebtenzzre improve printing of YaRN parameters
4f4e9480
cebtenzzre allow forcing ext_factor to zero if scaling type is YaRN
5d7a3a5c
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
9bd050f1
cebtenzzre
ggerganov
ggerganov ggerganov added demo
cebtenzzre
cebtenzzre fix rope_cuda parameter order
babf0e0c
cebtenzzre default n_yarn_orig_ctx to n_ctx_train
0050e1ec
cebtenzzre fix uninitialized cparams
09c31027
cebtenzzre cebtenzzre force pushed to 09c31027 2 years ago
cebtenzzre make printed param formatting more consistent
57c3442e
cebtenzzre
cebtenzzre fix missing import
a20b3e6c
bloc97
cebtenzzre
bloc97
cebtenzzre
bloc97
cebtenzzre
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
9ef91b13
ggerganov
cebtenzzre
jquesnelle
bloc97
bloc97
bloc97
jquesnelle Fix YaRN inverted scaling and add "rope.scaling.type" to GGUF (#1)
9ae10b3a
jquesnelle
jquesnelle fix YaRN ramp, make mscale conditional, add --yarn-orig-ctx (#2)
14cf93b1
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
237f1e79
cebtenzzre
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
bc8395d5
bloc97
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
4d5ed834
cebtenzzre
ggerganov
cebtenzzre
bloc97
ggerganov
ggerganov approved these changes on 2023-10-28
ggerganov
bloc97
cebtenzzre
ggerganov
bloc97
ggerganov
jquesnelle fix loading rope.scaling.original_context_length from GGUF (#3)
9fc82382
ggerganov
cebtenzzre implement YaRN for GPT-NeoX RoPE
15f26efd
cebtenzzre Merge branch 'master' of https://github.com/ggerganov/llama.cpp into …
081f7381
cebtenzzre cebtenzzre merged 898aeca9 into master 2 years ago
cebtenzzre cebtenzzre deleted the ntkv2 branch 2 years ago
slaren
redthing1
cebtenzzre
LostRuins
redthing1
cebtenzzre cebtenzzre restored the head branch 2 years ago
cebtenzzre
cebtenzzre cebtenzzre deleted the ntkv2 branch 2 years ago
IridiumMaster
ggerganov
LostRuins
IridiumMaster
cebtenzzre
jxy
jxy
jxy
jxy
cebtenzzre
KerfuffleV2
jxy
jxy
FNsi
ggerganov
FNsi
Green-Sky
ggerganov
Dampfinchen
KerfuffleV2
bloc97
cebtenzzre cebtenzzre removed demo

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone