Add support of Falcon models (7b, 40b, 180b) to DeepSpeed-FastGen #4790
Add support of Falcon 7b model
e89633e3
Add TP support for Falcon 7b model
ed658aa7
Add support of Falcon new decoder arch (40b and 180b models)
26d5c80b
arashb
force pushed
from
8766da47
to
26d5c80b
2 years ago
Fix kv rotary kernels of Falcon 180b model for 8-way sharding
f4a340a7
Merge branch 'master' into arashb/falcon
ee7fc0f5
awan-10
approved these changes
on 2023-12-12
Update DeepSpeed-FastGen blog to include Falcon support
5557e73c
mrwyattii
approved these changes
on 2023-12-12
mrwyattii
merged
a7900bcc
into master 2 years ago
mrwyattii
deleted the arashb/falcon branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub