Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
bigscience-workshop/Megatron-DeepSpeed
Pull Requests
Commits
master
LS/alibi
LS/doc
Lucile/add-eval-only-arg
Lucile/delete_unnecessary_brackets
Lucile/useless-parenthesis
add-valid-data
bitfit
bloom-ds-inference-repos2
bloom-inference-meta
bnb-resume-2x
bseval_harness
cc-concurrency
chpt-conversion-fix
ckptavg
cluster_benchmark
consumed_samples_per_valid_dataset
cyclic_valid_dataloaders
debug_with_new_dataset
dependabot/pip/black-24.3.0
ds_ckpt_reshape-with-layer-norm-auto-sync
ds-version-check
fix-sample-ids
fp32-checkpoint-extraction
gpu-direct
hadyelsahar/main
launch-debug
license
log-grad-norm
lumi_eval
lumi_mtf
main
master
megatron-2.4-ds-pipe
mtf_p3
mtf-multival
new-dataset
no-shuffling-option
nozero_reshape
olruwase/ds_ckpt_reshape
olruwase/sync_layer_norms
prefixbseval
preprocess_from_HF_dataset
rm-duplicate-param-count
samson/spm
scratchpad
self_attention_stable_corby
skip-broken-tests
sync4
t0loading
test-conversion
thomas/add_shared_t5
thomas/evaluate_gpt_on_prefix_lm_loss
thomas/evaluate_gpt_speed_if_we_pass_attention_mask
thomas/fix_installation
thomas/fix_layer_norm
thomas/improve_test_to_test_custom_kernel
thomas/mlm_train_script
thomas/opt
thomas/test_different_layer_norm
tp-ln-debug
tr1-13B
tr8-104B
train-no-eval-restart
training_flos_rebase
training_flos
universal_ckpt_info
universal_to_fp32_checkpoint
val_args
fix bug when restarting with no eval in round 1
stas00
committed
4 years ago
318ef29e
Merge branch 'github-pr' into 'main'
jaredcasper
committed
4 years ago
90e0a0dd
Merge branch 'patch-1' of https://github.com/stas00/Megatron-LM into github-pr
jaredcasper
committed
4 years ago
4a35d50a
Merge branch 'typo-fix' of https://github.com/devrimcavusoglu/Megatron-LM into github-pr
jaredcasper
committed
4 years ago
e1318f0e
Merge branch 't5' of https://github.com/stas00/Megatron-LM into github-pr
jaredcasper
committed
4 years ago
7898c9a4
Merge branch 'main_retriver_merge_dpr' into 'main'
jaredcasper
committed
4 years ago
82b69e86
updated readme
mpatwary
committed
4 years ago
4c92ca82
updated readme
mpatwary
committed
4 years ago
32da2e78
updated readme
mpatwary
committed
4 years ago
baf2e2a4
updated readme
mpatwary
committed
4 years ago
9d350c9c
Merge branch 't5_scripts' into 'main'
jaredcasper
committed
4 years ago
2be1e510
Merge branch 'main_retriver_merge_dpr' into 'main'
jaredcasper
committed
4 years ago
598d7ee2
Merge branch 'main_retriver_merge_dpr' of ssh://gitlab-master.nvidia.com:12051/ADLR/megatron-lm into main_retriver_merge_dpr
mpatwary
committed
4 years ago
98113c69
addressed comments
mpatwary
committed
4 years ago
28450473
Clean up README.md a bit
jaredcasper
committed
4 years ago
473127f9
Adding readme
mpatwary
committed
4 years ago
c45109ed
Adding readme
mpatwary
committed
4 years ago
e287bf00
Adding readme
mpatwary
committed
4 years ago
293554aa
Adding readme
mpatwary
committed
4 years ago
8661ca26
Adding readme
mpatwary
committed
4 years ago
bab5cc4e
Adding readme
mpatwary
committed
4 years ago
1095d7e6
Adding readme
mpatwary
committed
4 years ago
d562d7b5
Adding readme
mpatwary
committed
4 years ago
a983cab3
fixed the evaluation hangs
mpatwary
committed
4 years ago
e46f3260
fixed the tensor size miss-mass issue
mpatwary
committed
4 years ago
ebfbfcec
resolved hang issue
mpatwary
committed
4 years ago
04c79f30
Update T5 scripts
deepakn94
committed
4 years ago
3dadd16d
Merge branch 'main' into main_retriver_merge_dpr
mpatwary
committed
4 years ago
84eb016c
updating script
mpatwary
committed
4 years ago
c7c65bbb
Merge branch 'main_retriver_merge_dpr' into 'main'
jaredcasper
committed
4 years ago
83c4d95a
Older