PR #8672 Threadpool: take 2 - SemanticDiff

Threadpool: take 2 #8672

slaren merged 48 commits into ggml-org:master from CodeLinaro:threadpool

fmz

github-actions

github-actions added testing

github-actions

github-actions added examples

github-actions

github-actions added server

github-actions

github-actions added ggml

fmz

fmz

fmz force pushed 1 year ago

fmz

fmz force pushed 1 year ago

fmz

fmz force pushed 1 year ago

fmz

slaren

fmz

fmz

fmz

oldgithubman

fmz

fmz force pushed 1 year ago

fmz

slaren

fmz

fmz

fmz force pushed 1 year ago

fmz

fmz

fmz force pushed 1 year ago

slaren

fmz

max-krasnyansky

max-krasnyansky

mofosyne

mofosyne added Review Complexity : Medium

fmz

fmz force pushed to 4aa7a727 1 year ago

fmz

fmz force pushed from 4aa7a727 1 year ago

slaren

max-krasnyansky

fmz

fmz force pushed 1 year ago

fmz

fmz force pushed to 22015290 1 year ago

max-krasnyansky

fmz

fmz

fmz

max-krasnyansky

max-krasnyansky force pushed from c1491e52 1 year ago

max-krasnyansky

max-krasnyansky

max-krasnyansky force pushed 1 year ago

slaren

max-krasnyansky

max-krasnyansky

max-krasnyansky

slaren

cpumaxx

fmz

fmz force pushed 1 year ago

max-krasnyansky

max-krasnyansky force pushed 1 year ago

max-krasnyansky

max-krasnyansky force pushed 1 year ago

fmz

slaren

slaren commented on 2024-08-24

slaren

max-krasnyansky

max-krasnyansky

max-krasnyansky

max-krasnyansky force pushed to c083ca31 1 year ago

max-krasnyansky

max-krasnyansky force pushed 1 year ago

max-krasnyansky

max-krasnyansky force pushed 1 year ago

max-krasnyansky

max-krasnyansky force pushed 1 year ago

slaren

slaren commented on 2024-08-26

fmz

Introduce ggml_compute_threadpool

130adf84

fmz

Minor fixes

a0aae528

fmz

fixed use after release bug

d5c9c14d

fmz

fixed a harmless race condition

82224f84

fmz

Fix Android bulid issue

817eaf0c

fmz

fix more race conditions

57637326

fmz

fix deadlock for cases where cgraph.n_nodes == 1

3008b31b

max-krasnyansky

threadpool: use cpu_get_num_math to set the default number of threadp…

96d6603d

max-krasnyansky

bench: create fresh threadpool for each test

29534415

max-krasnyansky

atomics: always use stdatomics with clang and use relaxed memory orde…

6fcc780b

max-krasnyansky

threadpool: make polling the default to match openmp behavior

3b62f7c1

max-krasnyansky

threadpool: do not wakeup threads in already paused threadpool

dfa63778

fix potential race condition in check_for_work

2e18f0d4

max-krasnyansky

threadpool: do not create two threadpools if their params are identical

48aa8eec

max-krasnyansky

threadpool: reduce pause/resume/wakeup overhead in common cases

494e27c7

max-krasnyansky

threadpool: add support for hybrid polling

b630acdb

max-krasnyansky

threadpool: reduce the number of barrier required

9d3e78c6

max-krasnyansky

threadpool: remove special-casing for disposable threadpools

538bd9f7

max-krasnyansky

threadpool: do not clear barrier counters between graphs computes (fi…

db45b6d3

max-krasnyansky

threadpool: use relaxed order for chunk sync

307fece5

max-krasnyansky

threadpool: remove abort_callback from threadpool state

63a0dad8

max-krasnyansky

threadpool: better naming for thread/cpumask releated functions

2358bb36

max-krasnyansky

threadpool: consistent use of int type for n_threads params

4a4d7150

max-krasnyansky

threadpool: add support for ggml_threadpool_params_default/init

c4452edf

max-krasnyansky

threadpool: move typedef into ggml.h

31541d74

max-krasnyansky

threadpool: fix apply_priority() function name

40648601

max-krasnyansky

threadpool: fix swift wrapper errors due to n_threads int type cleanup

f64c9751

max-krasnyansky

threadpool: enable --cpu-mask and other threadpool related options on…

c506d7fc

max-krasnyansky

threadpool: replace checks for compute_thread ret code with proper st…

8008463a

max-krasnyansky

threadpool: simplify threadpool init logic and fix main thread affini…

49ac51f2

max-krasnyansky

threadpool: update threadpool resume/pause function names

204377a0

max-krasnyansky

threadpool: enable openmp by default for now

93f170d8

max-krasnyansky

threadpool: don't forget to free workers state when omp is enabled

a7496bf7

max-krasnyansky

threadpool: avoid updating process priority on the platforms that do …

8186e961

max-krasnyansky

threadpool: update calling thread prio and affinity only at start/resume

658f16c3

max-krasnyansky

llama-bench: turn threadpool params into vectors, add output headers,…

8d5ab9a5

fmz

fmz force pushed to 8d5ab9a5 1 year ago

max-krasnyansky

llama-bench: add support for cool off between tests --delay

3bcc4dee

slaren

slaren commented on 2024-08-27

max-krasnyansky

threadpool: move process priority setting into the apps (bench and cli)

5d4c0a13

max-krasnyansky

max-krasnyansky force pushed 1 year ago

max-krasnyansky

threadpool: move all pause/resume logic into ggml

e3c22020

max-krasnyansky

max-krasnyansky force pushed to e3c22020 1 year ago

max-krasnyansky

threadpool: futher api cleanup and prep for future refactoring

c6328bc0

max-krasnyansky

max-krasnyansky

threadpool: minor indent fixes

bead7d47

max-krasnyansky

threadpool: improve setprioty error message

8e8f8ce4

slaren

slaren approved these changes on 2024-08-29

max-krasnyansky

Update examples/llama-bench/llama-bench.cpp

c6c27b14

max-krasnyansky

threadpool: fix indent in set_threadpool call

b97bd67e

max-krasnyansky

use int32_t for n_thread type in public llama.cpp API

cae35b9f

max-krasnyansky

threadpool: use _new and _free instead of _create and _release

c49d6340

max-krasnyansky

fix two more public APIs to use int32_t for n_threads

3b5f7c2a

max-krasnyansky

build: set _GNU_SOURCE for Adroid

52aa6772

max-krasnyansky

slaren

slaren merged 42c76d13 into master 1 year ago

slaren

max-krasnyansky

ggerganov

FranzKafkaYu

fmz

Login to write a write a comment.

Login via GitHub

Reviewers

slaren

slaren

max-krasnyansky

max-krasnyansky

Assignees

No one assigned

Labels

testing examples Review Complexity : Medium server ggml

Milestone

No milestone