Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
mosaicml/llm-foundry
Pull Requests
Commits
saaketh/generation_benchmarking_2
3.10-to-string
CPT_Offset
abhay/add_warmrestarts
abhay/better_logging
abhay/mm_dev
add_bbh
add_math
add_trtllm_dev
add_trtllm_wrapper
add-mflow-logger
aditi-generate-evals
angel/catch-file-not-found
angel/log-data-for-run-analytics
anna/dbrxeval
anna/eval-loader
anna/patch
anna/sp-import
asfandyar/fmapi
batch_code_eval_withtasks
bcui/tokenizer_breaking_checkpointing
bigning-patch-1
bilal-activation-monitor
boweny/composer-0.31
boweny/fsdp2/playground
boweny/param-init-dtensor
boweny/playground/onboarding
boweny/torch-2.7
boweny/use-foundry-image
brier_score
bruce/fix-wikiqa
bruce/vllm-eval
bruce/vllm-eval-v2
bruce/8k
bump_composer_version_0.26.0
bump_dev_version
bump_foundry_0.15.0_yamls
bump_mcli_examples_v0.13.0
bump_mcli_examples_v0.14.0
bump_mcli_examples_v0.16.0
bump_version_v0.7.0
bump_version_v0.9.0.dev0
bump_version_v0.18.0.dev0
bump_version_0.12.0.dev0
bump_version_0.14.0.dev0
bump_version_0.15.0.dev0
bump_version_0.16.0.dev0
bump_version_0.17.0.dev0
bump_version_0.17.0
bump-0.20.0.dev0
bump-composer-0.30.0
bump-example-yamls-0.19.0
bump-fa2-2.7.4.post1
bump-streaming-0.12.0
bump-te-1.3
byod/data_validation_fix
byod/data_validation
cache-docker-release-builds
catch-cluster-perm
catch-delta-table-not-found
catch-grcp-hardware
chronos-test-vincen
chuck/add_foundry_te_again
chuck/add_foundry_te_docker_no_deps
chuck/add_foundry_te_docker
chuck/add_foundry_te_torch_2_1
chuck/add_hf_ckpt_fix
chuck/add_llama3_yaml
chuck/add_te_together
chuck/add_te
chuck/add_torch_2_4_nightly_image
chuck/bump_mosaicml_version_again
chuck/bump_mosaicml_version
chuck/bump_torch_version
chuck/bump_torch_2_4
chuck/bump-torch-2-5
chuck/debug_keys
chuck/fix_eval_with_drop_last_flag
chuck/fix_hf_task_format
chuck/fix_llama3_yaml
chuck/fix_llm_foundry
chuck/fix_te_docker_shard
chuck/fix_te_eval_with_drop_last_flag
chuck/gpu-build-te
chuck/gpu-build-te-win
chuck/log_mpt_config
chuck/replace_hf_causal_lm
chuck/revert_te
chuck/rl_bpt
chuck/rl
chuck/save_te_onnx_export_main
chuck/speedup_add_foundry_te_docker_no_dep
chuck/te-install
chuck/test_callback_load
chuck/test_ckpt_fix
chuck/test_one_more_te
chuck/test_te_shard_weight
chuck/torch_2_5_bump
chuck/update_bpt
chuck/update_setup_dockerfile
clear-entrypoint-cli
cli99/eval
cli99/vllm-eval-v2-lctx
codestar12-patch-1
comment-ghcr
composer-bump
convert_examples_ckpt-cli
dataforge/enable_all_cpu
davis/update-lion8b
debug_f1_score
debug_gauntlet_v0.3
debug_hang
debug_resumeoom
debug_triton
dependabot/pip/datasets-gte-3.3.2-and-lt-4.4
dependabot/pip/flash-attn-2.8.3
dependabot/pip/huggingface-hub-hf_xet--gte-0.30.0-and-lt-0.37
dependabot/pip/onnxruntime-1.23.2
dependabot/pip/transformers-gte-v4.51.0-and-lt-4.58
deprecate-fsdp-config
eitan-patch-json
embedding-infer-step-size
enforce-compute-cluster-version
error
ethantang-db/composer_main
ethantang-db/composer_32_1_fix
ethantang-db/dle_package_upgrades
ethantang-db/sdpa
ethantang-db/slowdown_debug
ethantang-db/tokenizers_optional
ethantang-db/upgrade_transformers
ethantang-db/v0.23.0_dev
f1_score
fastrms
finetuning
fix-cl
fix-fp8-act-ckpt-flag
gate-megablocks
generation_kwargs_fix
habana_alpha
hanlin/dbrx_updates
hf_to_ft_convert_fix
hfcheckpointer-optional-generation-config
human_eval_pack
initialization_postlayernorms_residuals
irene/world-size-test
james/unsafe-types
jane/add-exceptions
jane/add-ft-error-handling
jane/download-hf-to-uc
jane/fail-run
jane/fix-error
jane/mlflow-upload
jane/re-on-timeout
jane/remove-rich
jane/test-exceptions
jerry/experimental
jerry/mlflow-objectstore-exp
jerry/model-cache
jerry/oras
jfrankle-061523
jfrankle-fix
jimmy/data-split
josejg/harmony-fix
josejg/harmony-fix-debug
josejg/qknorm
josejg/tunes
josejg-envlogger-name
josejg-register-nanmonitor
kmmlu
linden/tp
llama_cot_fix
lupesko-patch-1
main
manual-cl
matt/registration-error
matt/split-mds-script
matt/split-mds-script-new
mcli-version-bump
milo/catch-bad-split-regex
milo/catch-more-grpc
milo/catch-more-unknown-example-types
milo/catch-more-unknown-example-types-1
milo/data-prep
milo/fix-retry-crash-loop
milo/fsdp-2-playground
milo/harbor-checkpointer
milo/uncomment-gpu-tests
milo/update-readme-for-variables
milo/update-version-names
milo/wrap-spark-errors-ii
model_gauntlet_v0.1
mpt-7b-test
mpt-quantization-eval
mvpatel2000/fuse-chunk
mvpatel2000/mla
mvpatel2000/relu-attn
mvpatel2000/relu-squared-attn
mvpatel2000/sync
nancy/combined
nancy/register-model
nancyhung/update-error-message
nicholas/exp-debug-issues
nicholas/finetuning-exp
nicholas/uc-upload
nik/ds-llama
nik/ft-model-handler
open-source-embeddings
openai_compatible_gauntlet
output_eval_logging
perms-select-table
pipeline-default-none
quantization-benchmarking
rag_generation_tasks
rag_plus_f1
refactor_Qa
registration
release/v0.2.0
release/v0.3.0
release/v0.4.0
release/v0.5.0
release/v0.6.0
release/v0.7.0
release/v0.8.0
release/v0.9.0
release/v0.9.1
release/v0.10.0
release/v0.11.0
release/v0.12.0
release/v0.13.0
release/v0.13.1
release/v0.14.0
release/v0.14.1
release/v0.14.2
release/v0.14.3
release/v0.14.4
release/v0.14.5
release/v0.15.0
release/v0.15.1
release/v0.16.0
release/v0.17.0
release/v0.17.1
release/v0.18.0
release/v0.19.0
release/v0.20.0
release/v0.21.0
release/v0.22.0
release-base-images
release-docker-img
remove_cot
replace-fsdp-args
replace-gpu-testing
revert-1255-bump_version_v0.10.0.dev0
revert-1517-replace-fsdp-args
revert-1571-autoresume
revert-1636-dependabot/pip/databricks-connect-15.4.3
ricky-fsdp2-temp-version
ricky-yamls
rl-testing-ricky
rm_compile_glu
rm_torch
saaketh/NoOpTim
saaketh/cat-quantize
saaketh/composer_bump_0240
saaketh/composer_022_upgrade
saaketh/composer-bump-0280
saaketh/dataset_rev
saaketh/date_string
saaketh/docker_img_torch_bump
saaketh/fc_config
saaketh/float8_exp_linears
saaketh/fused_glu
saaketh/generation_benchmarking
saaketh/generation_benchmarking_2
saaketh/hf_checkpoint_hang
saaketh/hf_ckpt_logs
saaketh/hf_ckpt_mem
saaketh/icl_req_false
saaketh/logs-inv
saaketh/lora_init_test
saaketh/meta_rope
saaketh/modified_initialization
saaketh/moe_defaults
saaketh/name_or_path
saaketh/openai-bump
saaketh/peft_trainable
saaketh/pep585
saaketh/qlora_eval
saaketh/quant_save
saaketh/readme_installs_fix
saaketh/remove_olmo
saaketh/remove_te
saaketh/replication_test
saaketh/revert_dataloader
saaketh/streaming_v081
saaketh/streaming076
saaketh/streaming-0100
saaketh/update_yamls
schema-perms-user-error
science/peft
sequential_code_gen_samples
sharegpt-format
sharegpt-format-eitan
shitaoli-db/MCLOUD-4623
shitaoli-db/fix-dataloader-error
staging-debug
structured-logs
tessa/callib
tessa/callibration-script
tessa/copyrighteval
tessa/output_eval_logging
tessa-safety-eval
test-gpu
test-sharding
testing-semdedup1
tool-use
torch-2-3-bump-1
torch-2.7-upgrade-ricky
torch-mem
torch-upgrade
train-cli
truthfulqa
update_gpu_tests
update-version
updt_new_group
use_remote_uploader_v2
validate-cluster-delta
vanshcsingh/add-logging
verbose-foundry
vincent-remove-composer-init
will/test_deps
xiaohan/delta_converter_upgrade
xiaohan/delta-streaming-test
xiaohan/enable_extra_arg_test
xiaohan/env_no_databricks
xiaohan/fix_setuppy
zero-shot
yo'
snarayan21
committed
1 year ago
ff1c4d5a
tp
snarayan21
committed
1 year ago
ba18acea
tp
snarayan21
committed
1 year ago
3579b137
notp
snarayan21
committed
1 year ago
3a857ee5
4
snarayan21
committed
1 year ago
0e3a7c2c
1
snarayan21
committed
1 year ago
25043b85
8
snarayan21
committed
1 year ago
50717345
layerplan
snarayan21
committed
1 year ago
24165faf
layerplan
snarayan21
committed
1 year ago
a7286292
layerplan
snarayan21
committed
1 year ago
bca27a99
named params
snarayan21
committed
1 year ago
43c168eb
more_steps
snarayan21
committed
1 year ago
4c37c15d
more_scripts
snarayan21
committed
1 year ago
c87a0cc0
yo
snarayan21
committed
1 year ago
d9d73039
yo
snarayan21
committed
1 year ago
7fe58bd2
yo
snarayan21
committed
1 year ago
880b5ad1
yo
snarayan21
committed
1 year ago
8faa6532
profile
snarayan21
committed
1 year ago
ae7ec785
eval
snarayan21
committed
1 year ago
6e3a4f7a
train
snarayan21
committed
1 year ago
89e6cb95
memrep
snarayan21
committed
1 year ago
aec786ff
memreport
snarayan21
committed
1 year ago
82b54481
mem
snarayan21
committed
1 year ago
987ed4f6
no
snarayan21
committed
1 year ago
28aec8b4
yo
snarayan21
committed
1 year ago
66382c1a
yo
snarayan21
committed
1 year ago
f255c5db
device_bs
snarayan21
committed
1 year ago
ef8adca0
no
snarayan21
committed
1 year ago
488eeb6e
gen
snarayan21
committed
1 year ago
ba34e820
mask
snarayan21
committed
1 year ago
eb232fbf
Older