llama 70b model fusion and shardding #18175
add shardding support for llama model and update convert/benchmark sc…
c958bc6e
fix bug in llama input
dd41176c
lint style fix
92066936
take comments
b30ff30b
handle export device separately for different model
c4649e8a
lint fix
f393cb2a
Merge branch 'microsoft:main' into frdong/shard-llama
59a27e2e
take comments
fdd4c5fd
take comments
79e185dc
fix comments
0093725a
sync and merge with latest main
5ce04d11
tianleiwu
dismissed these changes
on 2023-11-02
fix benchmark_all and change dist_settings mpi is only required for m…
dcddb283
frank-dong-ms
dismissed their stale review
via dcddb283
2 years ago
fix lint error
aa8ffb68
fix readme
4c567ca0
remove extra line
31c011a1
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub