llama.cpp
llama.cpp : split llama_context_params into model and context params
#3301
Merged

llama.cpp : split llama_context_params into model and context params #3301

ggerganov merged 17 commits into master from llama-model-params
slaren
slaren llama.cpp : split llama_context_params into model and context params
cf1f8059
slaren slaren added breaking change
slaren slaren added refactoring
slaren fix metal build
39f4afac
slaren fix freq_base/scale default to model value
f28e4953
slaren llama-bench : keep the same model between tests when possible
96f6dcde
slaren move n_threads to llama_context_params, add n_threads_batch
7f953792
slaren fix mpi build
d41b53ca
slaren slaren marked this pull request as ready for review 2 years ago
slaren
slaren remove kv_size(), cuda scratch fixes
ceb18e44
JohannesGaessler
slaren
JohannesGaessler
slaren remove low-vram option
92fb8ab5
slaren
ggerganov
ggerganov approved these changes on 2023-09-22
ggerganov ggerganov added high priority
netrunnereve
slaren add n_threads_batch to system info, refactor to get_system_info()
a6084cc7
slaren add documentation about --threads-batch to the READMEs
34696841
slaren llama-bench fix
e5afe420
slaren main : fix rope freq/scale warning
0e9ed7f8
slaren llama.cpp : add llama_get_model
8f5b0eaa
slaren
ggerganov
slaren Merge remote-tracking branch 'origin/master' into llama-model-params
65b83f37
slaren remove duplicated ctx/model functions
5659391b
slaren
slaren commented on 2023-09-28
slaren cuda : print total VRAM used
17e841ac
slaren Merge remote-tracking branch 'origin/master' into llama-model-params
c8a9658e
slaren
ggerganov ggerganov merged 16bc66d9 into master 1 year ago
ggerganov ggerganov deleted the llama-model-params branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone