llama.cpp
Add AfmoeForCausalLM support
#16477
Merged

Add AfmoeForCausalLM support #16477

CISC merged 10 commits into ggml-org:master from bartowski1182:master
bartowski1182
github-actions github-actions added python
bartowski1182 bartowski1182 force pushed 38 days ago
github-actions github-actions added model
bartowski1182 Add AFMOE model support
3fd69c59
bartowski1182 bartowski1182 force pushed to 3fd69c59 37 days ago
bartowski1182 Update to vocab
93a2fb46
bartowski1182 Add model sizing
7c5d7184
bartowski1182 bartowski1182 marked this pull request as ready for review 30 days ago
bartowski1182 bartowski1182 requested a review from CISC CISC 30 days ago
bartowski1182 bartowski1182 requested a review from ggerganov ggerganov 30 days ago
bartowski1182 Undo Rope change for ARCEE model
13aaafe6
CISC
CISC commented on 2025-11-13
bartowski1182 Address review comments
34dc2a37
bartowski1182
bartowski1182 commented on 2025-11-13
ngxson
ngxson commented on 2025-11-13
bartowski1182 Update modeling code is_sliding -> use_rope, replace hard-coded logic
1b9558f8
bartowski1182 Fix AFMOE tokenizer
e41a5bd4
CISC
CISC commented on 2025-11-13
bartowski1182
bartowski1182 commented on 2025-11-13
CISC
CISC commented on 2025-11-13
CISC
CISC commented on 2025-11-13
CISC
CISC commented on 2025-11-13
bartowski1182 Update convert_hf_to_gguf.py
e27c3f1d
bartowski1182 Update convert_hf_to_gguf.py
684aeada
bartowski1182 Update AFMoE tokenizer class identification to be more unique
ddddf8d7
CISC
CISC approved these changes on 2025-11-14
ngxson
ngxson approved these changes on 2025-11-14
CISC
CISC CISC merged e1fcf8b0 into master 29 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone