Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
bigscience-workshop/Megatron-DeepSpeed
Pull Requests
Commits
mtf-multival
LS/alibi
LS/doc
Lucile/add-eval-only-arg
Lucile/delete_unnecessary_brackets
Lucile/useless-parenthesis
add-valid-data
bitfit
bloom-ds-inference-repos2
bloom-inference-meta
bnb-resume-2x
bseval_harness
cc-concurrency
chpt-conversion-fix
ckptavg
cluster_benchmark
consumed_samples_per_valid_dataset
cyclic_valid_dataloaders
debug_with_new_dataset
dependabot/pip/black-24.3.0
ds_ckpt_reshape-with-layer-norm-auto-sync
ds-version-check
fix-sample-ids
fp32-checkpoint-extraction
gpu-direct
hadyelsahar/main
launch-debug
license
log-grad-norm
lumi_eval
lumi_mtf
main
master
megatron-2.4-ds-pipe
mtf_p3
mtf-multival
new-dataset
no-shuffling-option
nozero_reshape
olruwase/ds_ckpt_reshape
olruwase/sync_layer_norms
prefixbseval
preprocess_from_HF_dataset
rm-duplicate-param-count
samson/spm
scratchpad
self_attention_stable_corby
skip-broken-tests
sync4
t0loading
test-conversion
thomas/add_shared_t5
thomas/evaluate_gpt_on_prefix_lm_loss
thomas/evaluate_gpt_speed_if_we_pass_attention_mask
thomas/fix_installation
thomas/fix_layer_norm
thomas/improve_test_to_test_custom_kernel
thomas/mlm_train_script
thomas/opt
thomas/test_different_layer_norm
tp-ln-debug
tr1-13B
tr8-104B
train-no-eval-restart
training_flos_rebase
training_flos
universal_ckpt_info
universal_to_fp32_checkpoint
val_args
Set iteration to args by default
Muennighoff
committed
3 years ago
477cda6d
Add multiple evaluation compat
Muennighoff
committed
3 years ago
6c1018f6
Merge branch 't0loading' into lossseq
Muennighoff
committed
3 years ago
456327c1
Move view
Muennighoff
committed
3 years ago
d9a91feb
Reshape loss mask
Muennighoff
committed
3 years ago
549f4993
Move norm to batch pipe
Muennighoff
committed
3 years ago
a6b26240
Loss mask is already float
Muennighoff
committed
3 years ago
2e7554d7
Clarify loss on targets & remove kwarg
Muennighoff
committed
3 years ago
fce1a98e
Add reset-progress key
Muennighoff
committed
3 years ago
26997216
Merge branch 'main' into t0loading
Muennighoff
committed
3 years ago
0a324592
Add norm_target_loss arg
Muennighoff
committed
3 years ago
7bc1dd20
BLOOM Inference via DeepSpeed-Inference, Accelerate and DeepSpeed-ZeRO (#308)
stas00
committed
3 years ago
Verified
3932c749
Simplify division
Muennighoff
committed
3 years ago
900c8356
Reuse variable
Muennighoff
committed
3 years ago
616cfe86
Efficient loss normalization
Muennighoff
committed
3 years ago
992446c8
Tmp lossseq
Muennighoff
committed
3 years ago
462efd99
Add bos option
Muennighoff
committed
3 years ago
dc8d0abb
Reshape deepspeed checkpoint (#239)
tjruwase
committed
3 years ago
Verified
0f23a729
Add prefixlm arg
Muennighoff
committed
3 years ago
b15ca2d5
Allow not using torch distributed
Muennighoff
committed
3 years ago
b62dcafc
Avoid loading module when not loading optim
Muennighoff
committed
3 years ago
cb0313ba
Remove helper scripts
Muennighoff
committed
3 years ago
ca740f1e
Remove unnec imports
Muennighoff
committed
3 years ago
2dfe5d11
JSON helper scripts
Muennighoff
committed
3 years ago
a55d2fb5
Merge remote
Muennighoff
committed
3 years ago
fb8ecb8c
Add helpers & set is_causal to true
Muennighoff
committed
3 years ago
89460c0a
Update tools/preprocess_data.py
Muennighoff
committed
3 years ago
Verified
63daa46f
Add prepend-space arg
Muennighoff
committed
3 years ago
0fcb19c1
Swap decoder_is_inputs & segment_ids
Muennighoff
committed
3 years ago
abdd7030
Enable loading ckpt for t0 finetuning
Muennighoff
committed
3 years ago
90b8f46d
Older