llama.cpp
IBM Granite MoE Architecture
#9438
Merged

Loading