llama.cpp
ggml-cpu: FA split across kv for faster TG
#19209
Merged

Commits
  • ggml-cpu: split across kv for faster TG
    am17an committed 35 days ago
  • simplify sinks application
    am17an committed 35 days ago
  • add ref impl
    am17an committed 35 days ago
Loading