Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
bigscience-workshop/Megatron-DeepSpeed
Pull Requests
Commits
train-no-eval-restart
LS/alibi
LS/doc
Lucile/add-eval-only-arg
Lucile/delete_unnecessary_brackets
Lucile/useless-parenthesis
add-valid-data
bitfit
bloom-ds-inference-repos2
bloom-inference-meta
bnb-resume-2x
bseval_harness
cc-concurrency
chpt-conversion-fix
ckptavg
cluster_benchmark
consumed_samples_per_valid_dataset
cyclic_valid_dataloaders
debug_with_new_dataset
dependabot/pip/black-24.3.0
ds_ckpt_reshape-with-layer-norm-auto-sync
ds-version-check
fix-sample-ids
fp32-checkpoint-extraction
gpu-direct
hadyelsahar/main
launch-debug
license
log-grad-norm
lumi_eval
lumi_mtf
main
master
megatron-2.4-ds-pipe
mtf_p3
mtf-multival
new-dataset
no-shuffling-option
nozero_reshape
olruwase/ds_ckpt_reshape
olruwase/sync_layer_norms
prefixbseval
preprocess_from_HF_dataset
rm-duplicate-param-count
samson/spm
scratchpad
self_attention_stable_corby
skip-broken-tests
sync4
t0loading
test-conversion
thomas/add_shared_t5
thomas/evaluate_gpt_on_prefix_lm_loss
thomas/evaluate_gpt_speed_if_we_pass_attention_mask
thomas/fix_installation
thomas/fix_layer_norm
thomas/improve_test_to_test_custom_kernel
thomas/mlm_train_script
thomas/opt
thomas/test_different_layer_norm
tp-ln-debug
tr1-13B
tr8-104B
train-no-eval-restart
training_flos_rebase
training_flos
universal_ckpt_info
universal_to_fp32_checkpoint
val_args
fix bug when restarting with no eval in round 1
stas00
committed
4 years ago
66b4a6c1
Revert "train_samples check fix"
stas00
committed
4 years ago
0e78387d
train_samples check fix
stas00
committed
4 years ago
d13f5c6b
Fixed single node test scripts to use DS
TevenLeScao
committed
4 years ago
Verified
a7a534ae
Fixed single node test scripts to use DS
TevenLeScao
committed
4 years ago
Verified
1556d7e6
adjust the script
stas00
committed
4 years ago
5ab4fad9
add example script for one node
stas00
committed
4 years ago
2048ca57
update docs
stas00
committed
4 years ago
12503bd2
consistency
stas00
committed
4 years ago
8cf27f22
install instructions; consistent indent
stas00
committed
4 years ago
7d880b7c
update to deepspeed usage; rm irrelevant info
stas00
committed
4 years ago
c5e0b248
Tiny gpt training script for local testing
TevenLeScao
committed
4 years ago
Verified
d61ca5c2
HF tokenizers (on the training side) (#5)
TevenLeScao
committed
4 years ago
Verified
0b2f0df4
Update README.md
TevenLeScao
committed
4 years ago
Verified
ca4ea155
Update README.md
TevenLeScao
committed
4 years ago
Verified
a0ab8f48
Create README for preprocessing
TevenLeScao
committed
4 years ago
Verified
ebb6a078
Hf tok pipeline (#2)
sbmaruf
committed
4 years ago
Verified
b1a30de8
add the explanation about the forking problems.
stas00
committed
4 years ago
Verified
5502865b
explain what this repo is about.
stas00
committed
4 years ago
Verified
fb1a120f
Merge pull request #1 from microsoft/megatron-2.4-ds-pipe
jeffra
committed
4 years ago
Verified
b56b50bb
Merge branch 'megatron-2.4-ds-pipe' of github.com:microsoft/Megatron-DeepSpeed into megatron-2.4-ds-pipe
Shaden Smith
committed
4 years ago
346b89a6
DS checkponiting, ZeRO incoming
Shaden Smith
committed
4 years ago
311e56c0
formatting
jeffra
committed
4 years ago
b737bb82
formatting
jeffra
committed
4 years ago
b1c15719
add git info for megatron + ds_report
jeffra
committed
4 years ago
5ef22584
Merge pull request #13 from jeffra/olruwase/megatron-2.4_dse
tjruwase
committed
4 years ago
Verified
846415c9
Remove dead code
tjruwase
committed
4 years ago
9dd7d9eb
Fix newline
tjruwase
committed
4 years ago
c88c876e
Running with ZeRO3
tjruwase
committed
4 years ago
745d74c4
remove debug
Shaden Smith
committed
4 years ago
ae6d7977
Older