llama.cpp
update: support Qwen2-57B-A14B
#7835
Merged

Commits
  • update: convert-hf-to-gguf.py to support Qwen2-57B-A14B
    legraphista committed 1 year ago
  • fix: QWEN2MOE support for expert_feed_forward_length
    legraphista committed 1 year ago
  • update: convert-hf-to-gguf.py cleanup for Qwen2MoeForCausalLM
    legraphista committed 1 year ago
  • fix: QWEN2MOE support for expert_feed_forward_length
    legraphista committed 1 year ago
Loading