DeepSpeed
e6fcc226 - fix fp16 Qwen2 series model to DeepSpeed-FastGen (#6028)

Commit
1 year ago
fix fp16 Qwen2 series model to DeepSpeed-FastGen (#6028) based on PR #5403 (Qwen1.5-MOE) and #5219 (Qwen1.5), support Qwen2 series model. including: 0.5B, 1.5B, 7B, 57B-A14B, and 72B models. Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
Loading