llama.cpp
Add llama 3.1 rope scaling factors to llama conversion and inference
#8676
Merged

Add llama 3.1 rope scaling factors to llama conversion and inference #8676

ggerganov merged 6 commits into ggml-org:master from jmorganca:master
jmorganca
github-actions github-actions added python
jmorganca
jmorganca commented on 2024-07-24
Galunid
Galunid commented on 2024-07-24
jmorganca jmorganca marked this pull request as draft 1 year ago
jmorganca
github-actions github-actions added Nvidia GPU
jxy
jmorganca jmorganca marked this pull request as ready for review 1 year ago
jxy
jxy commented on 2024-07-24
jmorganca jmorganca force pushed from 2f4809bd to 3f53dfe4 1 year ago
jxy
jmorganca jmorganca force pushed from 3f53dfe4 to 72690676 1 year ago
jmorganca
compilade
compilade commented on 2024-07-25
kallewoof
MoonRide303
tristandruyen
Nexesenex
MoonRide303
LostRuins
MoonRide303
LostRuins
kallewoof
schmorp
compilade
compilade commented on 2024-07-25
jmorganca
ggerganov
gilbertgong
3Simplex
jmorganca
gilbertgong
ggerganov
jmorganca
ggerganov
ggerganov ggerganov requested a review from compilade compilade 1 year ago
m18coppola
qnixsynapse
LostRuins
slaren
bartowski1182
bartowski1182 commented on 2024-07-26
gilbertgong
oldgithubman
m18coppola
compilade
compilade commented on 2024-07-26
compilade
jmorganca Add llama 3.1 rope scaling factors to llama conversion and inference
e6bacb40
jmorganca Update convert_hf_to_gguf.py
24540dd2
jmorganca address comments
1a3a1b6d
jmorganca address comments
90fd87df
jmorganca jmorganca force pushed from a946b40e to 90fd87df 1 year ago
compilade
compilade commented on 2024-07-26
ddh0
compilade
compilade approved these changes on 2024-07-27
compilade
ddh0
jmorganca Update src/llama.cpp
e6d5bed7
jmorganca Update convert_hf_to_gguf.py
658041d1
ggerganov ggerganov merged b5e95468 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone