Hp/split/load model (test CI) #4
split: support in llama_model_loader
7c64fef9
Avoir copying the entire vector
b8feff41
split: move llama_tensor_offset to llama_model_loader
18ff6ca8
Merge branch 'master' into hp/split/load-model
60a87ae0
llama_model_loader: PR feedbacks:
1892ae7e
avoid copying the entire vector
00381b07
Simplify this by making these optional, switch some layer creation te…
c34a5dee
Handle optional tensors
1c931f3d
llama_model_loader: fail if backend cannot allocate buffer
d8b567d2
fix mmap buffer management
02020b04
test new llama_split_prefix
93069368
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub