PR #7225 Add phi3 128K model support

Add phi3 128K model support #7225

ggerganov merged 22 commits into ggml-org:master from liuwei-git:master

mofosyne added model

mofosyne added Review Complexity : High

ggerganov commented on 2024-05-12

mofosyne marked this pull request as draft 1 year ago

ggerganov marked this pull request as ready for review 1 year ago

ggerganov requested a review from

slaren 1 year ago

slaren commented on 2024-05-16

add phi3 128k support in convert-hf-to-gguf

8fa413d8

add phi3 128k support in cuda

56d9fa72

address build warnings on llama.cpp

cc19780a

adjust index value in cuda long rope freq factors

9f871298

add long rope support in ggml cpu backend

c5569311

make freq factors only depend on ctx size

6333ed1a

remove unused rope scaling type 'su' frin gguf converter

5683db3b

fix flint warnings on convert-hf-to-gguf.py

b1f491a2

set to the short freq factor when context size is small than trained …

d05ae12e

add one line of comments

8a9c897f

metal : support rope freq_factors

2d473a4a

ggml : update ggml_rope_ext API to support freq. factors

471d8170

backends : add dev messages to support rope freq. factors

352c3859

minor : style

f4cb482c

tests : update to use new rope API

e7c7d8ca

backends : fix pragma semicolons

4f787ead

minor : cleanup

d93b5cad

llama : move rope factors from KV header to tensors

600896b8

ggerganov force pushed to 600896b8 1 year ago

llama : remove tmp assert

23b72b87

ggerganov approved these changes on 2024-05-21

github-actions added testing

github-actions added Nvidia GPU

github-actions added Vulkan

github-actions added examples

github-actions added python

github-actions added ggml

github-actions added SYCL

cuda : fix compile warning

e9acbce6

mofosyne added merge ready

convert : read/write n_head_kv

92711138

llama : fix uninitialized tensors

7528c705

ggerganov merged 201cc11a into master 1 year ago

Reviewers

ggerganov

compilade

slaren

Assignees

No one assigned

Labels

model testing Nvidia GPU Vulkan examples python Review Complexity : High ggml merge ready SYCL

Milestone

No milestone

llama.cpp Add phi3 128K model support #7225 Merged

Add phi3 128K model support #7225

llama.cpp
Add phi3 128K model support
#7225

Merged