llama.cpp
1ec7ba0c - opencl: add q4_1 MoE for Adreno (#22856)

Commit
48 days ago
opencl: add q4_1 MoE for Adreno (#22856) * Q4_1 MoE CLC pass sanity check * remove unnecessary code * opencl: remove unnecessary asserts and reformat * opencl: fix supports_op for q4_1 moe * q4_1 moe is supported by Adreno with certain shapes --------- Co-authored-by: Li He <lih@qti.qualcomm.com>
Author
Parents
Loading