llama.cpp
Threadpool: take 2
#8672
Merged

Threadpool: take 2 #8672

slaren merged 48 commits into ggml-org:master from CodeLinaro:threadpool
fmz
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added server
github-actions github-actions added ggml
fmz
fmz fmz force pushed 1 year ago
fmz fmz force pushed 1 year ago
fmz fmz force pushed 1 year ago
fmz
slaren
fmz
fmz
fmz
oldgithubman
fmz fmz force pushed 1 year ago
fmz
slaren
fmz
fmz fmz force pushed 1 year ago
fmz
fmz fmz force pushed 1 year ago
slaren
fmz
max-krasnyansky
max-krasnyansky
mofosyne mofosyne added Review Complexity : Medium
fmz fmz force pushed to 4aa7a727 1 year ago
fmz fmz force pushed from 4aa7a727 1 year ago
slaren
max-krasnyansky
fmz fmz force pushed 1 year ago
fmz fmz force pushed to 22015290 1 year ago
max-krasnyansky
fmz
fmz
fmz
max-krasnyansky max-krasnyansky force pushed from c1491e52 1 year ago
max-krasnyansky
max-krasnyansky max-krasnyansky force pushed 1 year ago
slaren
max-krasnyansky
max-krasnyansky
max-krasnyansky
slaren
cpumaxx
fmz fmz force pushed 1 year ago
max-krasnyansky max-krasnyansky force pushed 1 year ago
max-krasnyansky max-krasnyansky force pushed 1 year ago
fmz
slaren
slaren commented on 2024-08-24
slaren
max-krasnyansky
max-krasnyansky
max-krasnyansky max-krasnyansky force pushed to c083ca31 1 year ago
max-krasnyansky max-krasnyansky force pushed 1 year ago
max-krasnyansky max-krasnyansky force pushed 1 year ago
max-krasnyansky max-krasnyansky force pushed 1 year ago
slaren
slaren commented on 2024-08-26
fmz Introduce ggml_compute_threadpool
130adf84
fmz Minor fixes
a0aae528
fmz fixed use after release bug
d5c9c14d
fmz fixed a harmless race condition
82224f84
fmz Fix Android bulid issue
817eaf0c
fmz fix more race conditions
57637326
fmz fix deadlock for cases where cgraph.n_nodes == 1
3008b31b
max-krasnyansky threadpool: use cpu_get_num_math to set the default number of threadp…
96d6603d
max-krasnyansky bench: create fresh threadpool for each test
29534415
max-krasnyansky atomics: always use stdatomics with clang and use relaxed memory orde…
6fcc780b
max-krasnyansky threadpool: make polling the default to match openmp behavior
3b62f7c1
max-krasnyansky threadpool: do not wakeup threads in already paused threadpool
dfa63778
fix potential race condition in check_for_work
2e18f0d4
max-krasnyansky threadpool: do not create two threadpools if their params are identical
48aa8eec
max-krasnyansky threadpool: reduce pause/resume/wakeup overhead in common cases
494e27c7
max-krasnyansky threadpool: add support for hybrid polling
b630acdb
max-krasnyansky threadpool: reduce the number of barrier required
9d3e78c6
max-krasnyansky threadpool: remove special-casing for disposable threadpools
538bd9f7
max-krasnyansky threadpool: do not clear barrier counters between graphs computes (fi…
db45b6d3
max-krasnyansky threadpool: use relaxed order for chunk sync
307fece5
max-krasnyansky threadpool: remove abort_callback from threadpool state
63a0dad8
max-krasnyansky threadpool: better naming for thread/cpumask releated functions
2358bb36
max-krasnyansky threadpool: consistent use of int type for n_threads params
4a4d7150
max-krasnyansky threadpool: add support for ggml_threadpool_params_default/init
c4452edf
max-krasnyansky threadpool: move typedef into ggml.h
31541d74
max-krasnyansky threadpool: fix apply_priority() function name
40648601
max-krasnyansky threadpool: fix swift wrapper errors due to n_threads int type cleanup
f64c9751
max-krasnyansky threadpool: enable --cpu-mask and other threadpool related options on…
c506d7fc
max-krasnyansky threadpool: replace checks for compute_thread ret code with proper st…
8008463a
max-krasnyansky threadpool: simplify threadpool init logic and fix main thread affini…
49ac51f2
max-krasnyansky threadpool: update threadpool resume/pause function names
204377a0
max-krasnyansky threadpool: enable openmp by default for now
93f170d8
max-krasnyansky threadpool: don't forget to free workers state when omp is enabled
a7496bf7
max-krasnyansky threadpool: avoid updating process priority on the platforms that do …
8186e961
max-krasnyansky threadpool: update calling thread prio and affinity only at start/resume
658f16c3
max-krasnyansky llama-bench: turn threadpool params into vectors, add output headers,…
8d5ab9a5
fmz fmz force pushed to 8d5ab9a5 1 year ago
max-krasnyansky llama-bench: add support for cool off between tests --delay
3bcc4dee
slaren
slaren commented on 2024-08-27
max-krasnyansky threadpool: move process priority setting into the apps (bench and cli)
5d4c0a13
max-krasnyansky max-krasnyansky force pushed 1 year ago
max-krasnyansky threadpool: move all pause/resume logic into ggml
e3c22020
max-krasnyansky max-krasnyansky force pushed to e3c22020 1 year ago
max-krasnyansky threadpool: futher api cleanup and prep for future refactoring
c6328bc0
max-krasnyansky
max-krasnyansky threadpool: minor indent fixes
bead7d47
max-krasnyansky threadpool: improve setprioty error message
8e8f8ce4
slaren
slaren approved these changes on 2024-08-29
max-krasnyansky Update examples/llama-bench/llama-bench.cpp
c6c27b14
max-krasnyansky threadpool: fix indent in set_threadpool call
b97bd67e
max-krasnyansky use int32_t for n_thread type in public llama.cpp API
cae35b9f
max-krasnyansky threadpool: use _new and _free instead of _create and _release
c49d6340
max-krasnyansky fix two more public APIs to use int32_t for n_threads
3b5f7c2a
max-krasnyansky build: set _GNU_SOURCE for Adroid
52aa6772
max-krasnyansky
slaren slaren merged 42c76d13 into master 1 year ago
slaren
max-krasnyansky
ggerganov
FranzKafkaYu
fmz

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone