llama.cpp
945bf106
- metal : add MoE kernel specialization for ne20=5 (#18667)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
6 days ago
metal : add MoE kernel specialization for ne20=5 (#18667) Add template specialization for kernel_mul_mm_id_map0 with ne20=5 to support models using 5 active experts (e.g., VAETKI).
References
#18667 - metal : add MoE kernel specialization for ne20=5
Author
dororodoroddo
Parents
64848deb
Loading