llama.cpp
945bf106 - metal : add MoE kernel specialization for ne20=5 (#18667)

Commit
6 days ago
metal : add MoE kernel specialization for ne20=5 (#18667) Add template specialization for kernel_mul_mm_id_map0 with ne20=5 to support models using 5 active experts (e.g., VAETKI).
Author
Parents
Loading