llama.cpp
Model: Granite MoE shared
#13269
Merged

Model: Granite MoE shared #13269

CISC merged 11 commits into ggml-org:master from gabe-l-hart:GraniteMoEShared
gabe-l-hart
github-actions github-actions added python
ngxson
ngxson commented on 2025-05-02
ngxson
ngxson commented on 2025-05-02
gabe-l-hart feat: Add GGUF conversion for granitemoeshared
a6529ccd
gabe-l-hart feat: hparam and arch plumbing for granitemoeshared
731c5fc4
gabe-l-hart fix: Split MoE fused tensors for shared experts in conversion
c5d897ed
gabe-l-hart feat: First WIP cut at model arch in cpp
054059ea
gabe-l-hart fix: Cleaner (maybe more correct?) splitting for gate/up
5a98b485
gabe-l-hart fix: Fix the input to the shared experts
9763c9a2
gabe-l-hart fix: Avoid architecture-specific checks for Granite MoE Shared
52d2ed6e
gabe-l-hart gabe-l-hart force pushed from 97de56d8 to 52d2ed6e 234 days ago
gabe-l-hart refactor: Split granite architectures out of llm_build_llama
44469949
gabe-l-hart
gabe-l-hart fix: Fix compiler warning about uninitialized inp_pos
3d792146
gabe-l-hart gabe-l-hart force pushed to 3d792146 234 days ago
ggerganov
ggerganov approved these changes on 2025-05-12
ggerganov
zunigasllc
gabe-l-hart
gabe-l-hart fix: Consoladate GraniteMoEShared into GraniteMoE for conversion
2aed91c3
gabe-l-hart fix: Consolidate GraniteMoEShared into GraniteMoE on the c++ side
33008e8c
gabe-l-hart
CISC CISC merged d590cd4c into master 231 days ago
gabe-l-hart gabe-l-hart deleted the GraniteMoEShared branch 231 days ago
ggerganov
ggerganov commented on 2025-05-14

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone