llama.cpp
806d397c - parallel : try smaller batches when the KV cache is fragmented

Commit
2 years ago
parallel : try smaller batches when the KV cache is fragmented
Author
Parents
Loading