Add fp16 support of Qwen1.5 models (0.5B to 72B) to DeepSpeed-FastGen #5219
Add fp16 support of Qwen1.5 models (0.5B to 72B) to DeepSpeed-FastGen
388fee35
ZonePG
force pushed
from
68206d8f
to
388fee35
1 year ago
mrwyattii
approved these changes
on 2024-03-02
Merge branch 'master' into master
7f6f0636
mrwyattii
merged
bcc617a0
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub