llama.cpp
c0389dba - CANN: Disable acl_graph for prefill stage (#15933)

Commit
94 days ago
CANN: Disable acl_graph for prefill stage (#15933) Since the prefill length is not fixed, graphs constructed for the prefill stage cannot be reused. For this reason, ACL graph execution is disabled by default during prefill.
Author
Parents
Loading