llama.cpp
Add phi3 128K model support
#7225
Merged

Add phi3 128K model support #7225

ggerganov merged 22 commits into ggml-org:master from liuwei-git:master
liuwei-git
mofosyne mofosyne added model
mofosyne mofosyne added Review Complexity : High
ggerganov
ggerganov commented on 2024-05-12
liuwei-git
mofosyne mofosyne marked this pull request as draft 1 year ago
ggerganov
ggerganov
ggerganov ggerganov marked this pull request as ready for review 1 year ago
ggerganov ggerganov requested a review from slaren slaren 1 year ago
liuwei-git
slaren
ggerganov
slaren
slaren commented on 2024-05-16
liuwei-git add phi3 128k support in convert-hf-to-gguf
8fa413d8
liuwei-git add phi3 128k support in cuda
56d9fa72
liuwei-git address build warnings on llama.cpp
cc19780a
liuwei-git adjust index value in cuda long rope freq factors
9f871298
liuwei-git add long rope support in ggml cpu backend
c5569311
liuwei-git make freq factors only depend on ctx size
6333ed1a
liuwei-git remove unused rope scaling type 'su' frin gguf converter
5683db3b
liuwei-git fix flint warnings on convert-hf-to-gguf.py
b1f491a2
liuwei-git set to the short freq factor when context size is small than trained …
d05ae12e
liuwei-git add one line of comments
8a9c897f
ggerganov metal : support rope freq_factors
2d473a4a
ggerganov ggml : update ggml_rope_ext API to support freq. factors
471d8170
ggerganov backends : add dev messages to support rope freq. factors
352c3859
ggerganov minor : style
f4cb482c
ggerganov tests : update to use new rope API
e7c7d8ca
ggerganov backends : fix pragma semicolons
4f787ead
ggerganov minor : cleanup
d93b5cad
ggerganov llama : move rope factors from KV header to tensors
600896b8
ggerganov ggerganov force pushed to 600896b8 1 year ago
ggerganov llama : remove tmp assert
23b72b87
ggerganov
ggerganov approved these changes on 2024-05-21
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added Vulkan
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added ggml
github-actions github-actions added SYCL
github-actions
ggerganov cuda : fix compile warning
e9acbce6
mofosyne mofosyne added merge ready
slaren
ggerganov
slaren
ggerganov convert : read/write n_head_kv
92711138
dillfrescott
ggerganov llama : fix uninitialized tensors
7528c705
ggerganov ggerganov merged 201cc11a into master 1 year ago
RonanKMcGovern
AlessandroW
RonanKMcGovern
RachidAR

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone