Add Qwen2Moe GGUF loading support #33264
update gguf doc, config and tensor mapping
af2bc3bb
add qwen2moe architecture support, GGUFQwen2MoeConverter and q4 unit …
6579d576
apply code style fixes
dd6a651d
reformat files
b49e3b2b
SunMarc
approved these changes
on 2024-09-05
assign GGUFQwen2Converter to qwen2_moe
1e9bf7e3
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub