llama.cpp
llama : refactor graph build code
#3837
Merged

llama : refactor graph build code #3837

ggerganov merged 21 commits into master from llama-refactor
ggerganov
ggerganov llama : factor out ggml-alloc from graph graph build functions
8b2420d2
ggerganov metal : disable kernel load log
5946d98f
ggerganov llama : factor out tensor offloading outside the build call (wip)
38aca9e1
ggerganov llama : offload rest of the models
83d2c437
ggerganov llama : update offload log messages to print node index
3af87713
ggerganov llama : comments
51c4f9ee
cebtenzzre
cebtenzzre commented on 2023-10-28
slaren
slaren commented on 2023-10-29
ggerganov llama : support offloading result_norm + comments
4e98897e
ggerganov llama : factor graph input into a function
0dc05b84
ggerganov llama : do tensor offload only with CUDA
e14aa461
ggerganov llama : fix res_norm offloading
79617902
ggerganov llama : try to optimize offloading code
b4ad03b3
ggerganov ggerganov force pushed from 66a54bfe to b4ad03b3 2 years ago
ggerganov llama : fix non-CUDA build
25cfbf67
ggerganov llama : try to fix build
739b85c9
ggerganov llama : move refact in correct place + optimize graph input
da936188
ggerganov llama : refactor tensor offloading as callback
1e9c5443
ggerganov llama : add layer index to all tensor names
8925cf9e
ggerganov llama : add functional header
76108793
ggerganov llama : comment
79ad7344
ggerganov llama : remove obsolete map for layer counting
210e6e5d
ggerganov ggerganov added refactoring
ggerganov llama : add llm_build helper functions (#3848)
5baefef4
ggerganov ggerganov marked this pull request as ready for review 2 years ago
ggerganov
ggerganov Merge branch 'master' into llama-refactor
afb39292
ggerganov ggerganov merged 71e3718a into master 2 years ago
jxy
cebtenzzre
ggerganov
Galunid

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone