Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
bigscience-workshop/Megatron-DeepSpeed
Pull Requests
Commits
Open
Closed
Feature/tigerbot
#399 by
i4never
was closed 2023-10-27 03:16
Feature/tigerbot
#393 by
i4never
was closed 2023-07-25 07:14
Fix/dataloader error
#384 by
EastInsure
was closed 2023-05-08 12:19
适配DCU
#368 by
hepj987
was closed 2023-03-06 04:58
Bsevalharness
#362 by
Muennighoff
was closed 2023-01-27 09:13
Distill megatron - test Draft WIP
#352 by
younesbelkada
was closed 2022-10-11 09:27
Distill megatron - WIP draft code
#351 by
younesbelkada
was closed 2022-09-28 09:38
[bloom inference scripts] improvements
#345 by
stas00
was merged 2022-09-13 18:51
[Bloom inference] further improvements
#344 by
stas00
was closed 2022-09-07 20:42
[ds-inference bloom] tweaks
#340 by
stas00
was merged 2022-09-07 17:43
Followup PR for adding generation-server
#339 by
mayank31398
was merged 2022-09-11 17:48
disable CI
#332 by
stas00
was merged 2022-08-18 16:17
merge main
#331 by
Muennighoff
was merged 2022-08-17 07:03
Add generation server scripts using HF accelerate and DS-inference
#328 by
mayank31398
was merged 2022-09-01 17:06
Add option to normalize loss per target
#326 by
Muennighoff
was merged 2022-11-03 17:38
Add generation server scripts
#325 by
mayank31398
was closed 2022-08-10 19:28
add args_deepspeed_gpt.sh
#322 by
xyn1201
was closed 2022-08-02 03:53
Generation server using HF accelerate and DS inference
#321 by
mayank31398
was closed 2022-08-08 17:31
add OnDevice and remove zero-inference
#316 by
jeffra
was merged 2022-07-19 01:22
BLOOM Inference via DeepSpeed-Inference, Accelerate and DeepSpeed-ZeRO
#308 by
stas00
was merged 2022-08-10 19:28
Add bias a weight we need to sync as well across TP
#307 by
thomasw21
was merged 2022-07-07 16:35
Fix causal attention mask
#306 by
thomasw21
was merged 2022-07-07 19:39
Combine Specs
#304 by
Muennighoff
was merged 2022-07-07 08:29
CI fixes
#302 by
stas00
was merged 2022-07-04 23:25
[MTF] Add `weighted-split-paths` support
#299 by
thomasw21
was merged 2022-07-06 10:20
MTF optimize dataloading
#298 by
thomasw21
was merged 2022-07-04 08:41
resolve conflict
#297 by
lintangsutawika
was closed 2022-07-02 04:16
Add P3 preparation script
#296 by
Muennighoff
was closed 2022-07-04 06:50
MTF train script
#295 by
thomasw21
was merged 2022-07-05 14:03
Merge MLM too fast 2
#294 by
thomasw21
was merged 2022-06-30 15:02
Older