llama.cpp
e5ca3937 - llama : do not cap thread count when MoE on CPU (#5419)

Commit
1 year ago
llama : do not cap thread count when MoE on CPU (#5419) * Not capping thread count when MoE inference is running on CPU * Whitespace
Author
Parents
Loading