DeepSpeed
Add fp16 support of Qwen1.5 models (0.5B to 72B) to DeepSpeed-FastGen
#5219
Merged

Loading