llama.cpp
Not capping thread count when MoE inference is running on CPU
#5419
Merged

Not capping thread count when MoE inference is running on CPU #5419

ggerganov merged 2 commits into master from moe-cpu-thread-cap
ptsochantaris
ptsochantaris Not capping thread count when MoE inference is running on CPU
f8dc954e
ptsochantaris Whitespace
d5a6e865
slaren
slaren approved these changes on 2024-02-08
slaren slaren requested a review from ggerganov ggerganov 1 year ago
kalomaze
kalomaze
ggerganov
ggerganov approved these changes on 2024-02-09
ggerganov ggerganov merged e5ca3937 into master 1 year ago
ptsochantaris ptsochantaris deleted the moe-cpu-thread-cap branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone