Megatron-DeepSpeed
Add generation server scripts using HF accelerate and DS-inference
#328
Merged

Add generation server scripts using HF accelerate and DS-inference #328

mayank31398 merged 33 commits into main from add-generation-server
mayank31398
mayank31398
stas00
stas00 commented on 2022-08-11
first step towards making libs
47f3fc0c
HF accelerate model
435af43f
refactor accelerate
d1676fcc
mayank31398 mayank31398 force pushed from 5bc3d845 to d1676fcc 3 years ago
refactor DS inference
eef490c9
refactor DS ZeRO
25d0c704
make inference library
7be14108
cli
5c31d9a3
server
29059555
request
12c4cf7b
remove MaxTokensError
46ade324
fix batch size error with DS inference server
c46d9576
mayank31398
type fix
f3dac05a
add latency
44256140
add latency
c97d6ea6
add min_length to default kwargs
f3385f2b
str kwargs
8f25200b
mayank31398 mayank31398 force pushed from 792cda07 to 8f25200b 3 years ago
str kwargs
b11bb7fa
stas00
stas00 commented on 2022-08-17
stas00
mayank31398
mayank31398
stas00
mayank31398
mayank31398
mayank31398
stas00
stas00
fix comma
99dedb03
add old scripts back
497f00ef
mayank31398
stas00
mayank31398
stas00
move scripts
aa8c08c1
drop data
25b5d851
minor changes + add README
92017708
mayank31398
update README
649d7f83
Merge branch 'main' into add-generation-server
aa9ebea3
drop nccl
997a5fa2
mayank31398
stas00
mayank31398
mayank31398
stas00
mayank31398
mayank31398
stas00
stas00
stas00 commented on 2022-08-23
stas00
stas00
stas00 commented on 2022-08-24
stas00
stas00 commented on 2022-08-24
stas00
stas00 requested changes on 2022-08-24
stas00
mayank31398
fix
85d9fcb8
mayank31398
default values
11d50f18
stas00
mayank31398
stas00
resolve issues
403424bd
handle keyboard interrupt
493b2ee0
remove caching
c84d9b77
mayank31398
mayank31398
stas00
stas00
mayank31398
use snapshot_download
81d14692
make server class
a85b4886
mayank31398
stas00
stas00
mayank31398
mayank31398
stas00
fix snapshot download
43a844cb
stas00
stas00
stas00 approved these changes on 2022-09-01
mayank31398
mayank31398 mayank31398 merged f9402d02 into main 3 years ago
mayank31398 mayank31398 deleted the add-generation-server branch 3 years ago
mayank31398
stas00

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone