PR #8332 Refactor lora adapter support

Refactor lora adapter support #8332

ngxson merged 42 commits into ggml-org:master from ngxson:xsn/fix_lora

lora: load to devide buft

67c5e14d

add patch tensor function

e9d7b6c0

correct tensor patch

4e28ad40

llama_lora_adapter_apply

1b4ffbac

ngxson requested a review from

slaren 1 year ago

github-actions added ggml

correct ggml_backend_tensor_copy

b88ce0f8

add llm_build_mm

f6d090d7

Merge branch 'master' into xsn/fix_lora

a1666aaa

fix auto merge

30faf1f3

slaren commented on 2024-07-07

ggerganov commented on 2024-07-08

update based on review comments

79e29827

add convert script

847135aa

no more transpose A

712fecba

add f16 convert

84288ff9

Merge branch 'master' into xsn/fix_lora

41ced241

add metadata check

0e161889

ngxson requested a review from

slaren 1 year ago

ngxson requested a review from

ggerganov 1 year ago

add sanity check

6c617e20

fix ftype

7a83f200

add requirements

d52455f2

fix requirements

802565ca

fix outfile

95b3eb05

Merge pull request #8 from ngxson/xsn/fix_lora_convert

03d24cae

github-actions added python

slaren commented on 2024-07-09

amr3k approved these changes on 2024-07-09

compilade commented on 2024-07-09

conversion: only allow selected models

ee2b35c6

fix types

713665db

cuda : do not use dmmv if the tensor does not have enough cols

f15167a4

llama : lora fixes

9841fbda

Merge pull request #9 from ggerganov/sl/fix_fix_lora

4fe0861a

do not disable mmap with lora

1faf7e5b

Merge branch 'master' into xsn/fix_lora

e68344cb

llm_build_lora_mm_id

916e9592

convert_lora : MoE LoRA conversion support

9d96328b

mofosyne added Review Complexity : Medium

convert_hf : simplify modify_tensors for InternLM2

8956543c

llama : use llm_build_lora_mm in most model graphs

87301bdd

ggerganov approved these changes on 2024-07-15

Merge branch 'master' into xsn/fix_lora

703573f6

auto scale

42415a48

Revert "auto scale"

5b181182

remove redundant params

f68d0924

ngxson requested a review from

slaren 1 year ago

Merge branch 'master' into xsn/fix_lora

b704448a

slaren commented on 2024-07-15

Apply suggestions from code review

9175f4b7

change kv metadata

0ba23bad

slaren approved these changes on 2024-07-15

ngxson added merge ready

compilade commented on 2024-07-15

ngxson removed merge ready

move add_type to __init__

b1c40695

Merge branch 'master' into xsn/fix_lora

4d9ac0f3

convert_hf : move add_type to main()

d09382fa

Merge branch 'master' into xsn/fix_lora

383b6bce

compilade commented on 2024-07-15

ngxson added merge ready

compilade approved these changes on 2024-07-15

ngxson merged 97bdd26e into master 1 year ago

ltoniazzi commented on 2024-07-25

Reviewers

compilade

slaren

ggerganov

amr3k

ltoniazzi

Assignees

No one assigned

Labels

python Review Complexity : Medium ggml merge ready

Milestone

No milestone

llama.cpp Refactor lora adapter support #8332 Merged

Refactor lora adapter support #8332

llama.cpp
Refactor lora adapter support
#8332

Merged