Megatron-DeepSpeed
BLOOM Inference via DeepSpeed-Inference, Accelerate and DeepSpeed-ZeRO
#308
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
58
Changes
View On
GitHub
BLOOM Inference via DeepSpeed-Inference, Accelerate and DeepSpeed-ZeRO
#308
stas00
merged 58 commits into
main
from
bloom-inference
hardcode the dtype depending on the model
efa354dd
stas00
changed the title
hardcode the dtype depending on the model
BLOOM Inference via DeepSpeed-Inference
3 years ago
change the mp based on the world_size
cafc3f5d
remove hardcoded world_size
daeb293b
add bigscience/bigscience-small-testing
7d5f7d46
Merge branch 'bloom-inference' of https://github.com/bigscience-works…
2d3d271b
fixes
1ff0f698
add zero-inference script
56b24ed3
fixes
67aab37c
fix
328ab0cc
working script
f2628b03
renames
195288e5
fixes
3c7b2cb6
fix for offline use
6c5c23ba
add benchmark
6b192274
add benchmark
10cbb2d4
update
494c212e
cleanup
2b67c0d9
update
3853724e
msecs
18967399
cleanup
7c9daaf1
improve
dca2c8f7
fix benchmark, add warmup
85580c0b
update
5ea3dee9
fix; thanks Michael Wyatt
737c6816
clarify
6be0cca4
Merge branch 'bloom-inference' of https://github.com/bigscience-works…
fea3902a
add bloom batch-inference script
fc9b458a
removed the names :-)
7b0edef2
fold the bs functionality from the other script
2120dd2a
fix
78bcbb7e
restore do_sample
e7468cd2
dump generate args
68f5ca6a
fix
1eca7c50
fix
8815fc3d
support any batchsize
034cc6fe
div by bs
155c3c32
mul by bs
73a8b7b7
add cpu_offload; sync scripts
09d74088
wip
695265d4
thomasw21
commented on 2022-07-14
improvements
1a7e891b
fixes
aba4055a
fixes
5e92d552
add accelerate script
39921122
fix
5a7057b0
stas00
changed the title
BLOOM Inference via DeepSpeed-Inference
BLOOM Inference via DeepSpeed-Inference, Accelerate and DeepSpeed-ZeRO
3 years ago
wip
47585312
wip
7550ee06
stats
5153c402
add OnDevice and remove zero-inference (#316)
cb50ea59
wip
a53fcaa5
rework generate + benchmark
72528797
figure out the memory map dynamically
2aa419d4
bug fix
4bd8ca5b
tjruwase
commented on 2022-07-19
fix ds-zero-inference wrt device
b76e5166
bug fix
ecfd5771
update
fd26b9c4
mayank31398
commented on 2022-07-21
update
e2bfe916
mayank31398
commented on 2022-07-23
mayank31398
approved these changes on 2022-07-29
fix
b9a67ea5
Merge remote-tracking branch 'origin/main' into bloom-inference
3862ef0f
stas00
merged
3932c749
into main
3 years ago
stas00
deleted the bloom-inference branch
3 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
mayank31398
tjruwase
thomasw21
tchaton
zcrypt0
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub