llama.cpp
Not capping thread count when MoE inference is running on CPU
#5419

Merged

Not capping thread count when MoE inference is running on CPU #5419

ggerganov merged 2 commits into master from moe-cpu-thread-cap

Not capping thread count when MoE inference is running on CPU

f8dc954e

Whitespace

d5a6e865

slaren approved these changes on 2024-02-08

slaren requested a review from

ggerganov 1 year ago

ggerganov approved these changes on 2024-02-09

ggerganov merged e5ca3937 into master 1 year ago

ptsochantaris deleted the moe-cpu-thread-cap branch 1 year ago

Reviewers

ggerganov

slaren

Assignees

No one assigned

Labels

None yet

Milestone

No milestone