llama.cpp
4b2a4778 - arg: allow -kvu flag for llama-perplexity (#18117)

Commit
9 hours ago
arg: allow -kvu flag for llama-perplexity (#18117) The -kvu (--kv-unified) flag is required for hellaswag and winogrande benchmarks which use coupled sequences. Without unified KV cache, these benchmarks fail with: split_equal: sequential split is not supported when there are coupled sequences in the input batch (you may need to use the -kvu flag) This change adds LLAMA_EXAMPLE_PERPLEXITY to the allowed examples for the -kvu argument, enabling its use with llama-perplexity.
Author
Parents
Loading