llama.cpp
4926419c - ggml: add ggml_can_fuse_subgraph (#16662)

Commit
15 days ago
ggml: add ggml_can_fuse_subgraph (#16662) * ggml: add ggml_can_fuse_subgraph * ggml-cuda: use ggml_can_fuse_subgraph for topk-moe * format * 1. remove inputs from signature as they are transient nodes 2. add check for views: view_src should be part of the subgraph * - combine check into one loop - check all view_src parents - other minor review comments * remove redudant if test * - rename and other minor review comments * add assert about count < 32
Author
Parents
Loading