Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
mosaicml/llm-foundry
Pull Requests
Commits
comment-ghcr
3.10-to-string
CPT_Offset
abhay/add_warmrestarts
abhay/better_logging
abhay/mm_dev
add_bbh
add_math
add_trtllm_dev
add_trtllm_wrapper
add-mflow-logger
aditi-generate-evals
angel/catch-file-not-found
angel/log-data-for-run-analytics
anna/dbrxeval
anna/eval-loader
anna/patch
anna/sp-import
asfandyar/fmapi
batch_code_eval_withtasks
bcui/tokenizer_breaking_checkpointing
bigning-patch-1
bilal-activation-monitor
boweny/composer-0.31
boweny/fsdp2/playground
boweny/param-init-dtensor
boweny/playground/onboarding
boweny/torch-2.7
boweny/use-foundry-image
brier_score
bruce/fix-wikiqa
bruce/vllm-eval
bruce/vllm-eval-v2
bruce/8k
bump_composer_version_0.26.0
bump_dev_version
bump_foundry_0.15.0_yamls
bump_mcli_examples_v0.13.0
bump_mcli_examples_v0.14.0
bump_mcli_examples_v0.16.0
bump_version_v0.7.0
bump_version_v0.9.0.dev0
bump_version_v0.18.0.dev0
bump_version_0.12.0.dev0
bump_version_0.14.0.dev0
bump_version_0.15.0.dev0
bump_version_0.16.0.dev0
bump_version_0.17.0.dev0
bump_version_0.17.0
bump-0.20.0.dev0
bump-composer-0.30.0
bump-example-yamls-0.19.0
bump-fa2-2.7.4.post1
bump-streaming-0.12.0
bump-te-1.3
byod/data_validation_fix
byod/data_validation
cache-docker-release-builds
catch-cluster-perm
catch-delta-table-not-found
catch-grcp-hardware
chronos-test-vincen
chuck/add_foundry_te_again
chuck/add_foundry_te_docker_no_deps
chuck/add_foundry_te_docker
chuck/add_foundry_te_torch_2_1
chuck/add_hf_ckpt_fix
chuck/add_llama3_yaml
chuck/add_te_together
chuck/add_te
chuck/add_torch_2_4_nightly_image
chuck/bump_mosaicml_version_again
chuck/bump_mosaicml_version
chuck/bump_torch_version
chuck/bump_torch_2_4
chuck/bump-torch-2-5
chuck/debug_keys
chuck/fix_eval_with_drop_last_flag
chuck/fix_hf_task_format
chuck/fix_llama3_yaml
chuck/fix_llm_foundry
chuck/fix_te_docker_shard
chuck/fix_te_eval_with_drop_last_flag
chuck/gpu-build-te
chuck/gpu-build-te-win
chuck/log_mpt_config
chuck/replace_hf_causal_lm
chuck/revert_te
chuck/rl_bpt
chuck/rl
chuck/save_te_onnx_export_main
chuck/speedup_add_foundry_te_docker_no_dep
chuck/te-install
chuck/test_callback_load
chuck/test_ckpt_fix
chuck/test_one_more_te
chuck/test_te_shard_weight
chuck/torch_2_5_bump
chuck/update_bpt
chuck/update_setup_dockerfile
clear-entrypoint-cli
cli99/eval
cli99/vllm-eval-v2-lctx
codestar12-patch-1
comment-ghcr
composer-bump
convert_examples_ckpt-cli
dataforge/enable_all_cpu
davis/update-lion8b
debug_f1_score
debug_gauntlet_v0.3
debug_hang
debug_resumeoom
debug_triton
dependabot/pip/datasets-gte-3.3.2-and-lt-4.4
dependabot/pip/flash-attn-2.8.3
dependabot/pip/huggingface-hub-hf_xet--gte-0.30.0-and-lt-0.37
dependabot/pip/onnxruntime-1.23.2
dependabot/pip/transformers-gte-v4.51.0-and-lt-4.58
deprecate-fsdp-config
eitan-patch-json
embedding-infer-step-size
enforce-compute-cluster-version
error
ethantang-db/composer_main
ethantang-db/composer_32_1_fix
ethantang-db/dle_package_upgrades
ethantang-db/sdpa
ethantang-db/slowdown_debug
ethantang-db/tokenizers_optional
ethantang-db/upgrade_transformers
ethantang-db/v0.23.0_dev
f1_score
fastrms
finetuning
fix-cl
fix-fp8-act-ckpt-flag
gate-megablocks
generation_kwargs_fix
habana_alpha
hanlin/dbrx_updates
hf_to_ft_convert_fix
hfcheckpointer-optional-generation-config
human_eval_pack
initialization_postlayernorms_residuals
irene/world-size-test
james/unsafe-types
jane/add-exceptions
jane/add-ft-error-handling
jane/download-hf-to-uc
jane/fail-run
jane/fix-error
jane/mlflow-upload
jane/re-on-timeout
jane/remove-rich
jane/test-exceptions
jerry/experimental
jerry/mlflow-objectstore-exp
jerry/model-cache
jerry/oras
jfrankle-061523
jfrankle-fix
jimmy/data-split
josejg/harmony-fix
josejg/harmony-fix-debug
josejg/qknorm
josejg/tunes
josejg-envlogger-name
josejg-register-nanmonitor
kmmlu
linden/tp
llama_cot_fix
lupesko-patch-1
main
manual-cl
matt/registration-error
matt/split-mds-script
matt/split-mds-script-new
mcli-version-bump
milo/catch-bad-split-regex
milo/catch-more-grpc
milo/catch-more-unknown-example-types
milo/catch-more-unknown-example-types-1
milo/data-prep
milo/fix-retry-crash-loop
milo/fsdp-2-playground
milo/harbor-checkpointer
milo/uncomment-gpu-tests
milo/update-readme-for-variables
milo/update-version-names
milo/wrap-spark-errors-ii
model_gauntlet_v0.1
mpt-7b-test
mpt-quantization-eval
mvpatel2000/fuse-chunk
mvpatel2000/mla
mvpatel2000/relu-attn
mvpatel2000/relu-squared-attn
mvpatel2000/sync
nancy/combined
nancy/register-model
nancyhung/update-error-message
nicholas/exp-debug-issues
nicholas/finetuning-exp
nicholas/uc-upload
nik/ds-llama
nik/ft-model-handler
open-source-embeddings
openai_compatible_gauntlet
output_eval_logging
perms-select-table
pipeline-default-none
quantization-benchmarking
rag_generation_tasks
rag_plus_f1
refactor_Qa
registration
release/v0.2.0
release/v0.3.0
release/v0.4.0
release/v0.5.0
release/v0.6.0
release/v0.7.0
release/v0.8.0
release/v0.9.0
release/v0.9.1
release/v0.10.0
release/v0.11.0
release/v0.12.0
release/v0.13.0
release/v0.13.1
release/v0.14.0
release/v0.14.1
release/v0.14.2
release/v0.14.3
release/v0.14.4
release/v0.14.5
release/v0.15.0
release/v0.15.1
release/v0.16.0
release/v0.17.0
release/v0.17.1
release/v0.18.0
release/v0.19.0
release/v0.20.0
release/v0.21.0
release/v0.22.0
release-base-images
release-docker-img
remove_cot
replace-fsdp-args
replace-gpu-testing
revert-1255-bump_version_v0.10.0.dev0
revert-1517-replace-fsdp-args
revert-1571-autoresume
revert-1636-dependabot/pip/databricks-connect-15.4.3
ricky-fsdp2-temp-version
ricky-yamls
rl-testing-ricky
rm_compile_glu
rm_torch
saaketh/NoOpTim
saaketh/cat-quantize
saaketh/composer_bump_0240
saaketh/composer_022_upgrade
saaketh/composer-bump-0280
saaketh/dataset_rev
saaketh/date_string
saaketh/docker_img_torch_bump
saaketh/fc_config
saaketh/float8_exp_linears
saaketh/fused_glu
saaketh/generation_benchmarking
saaketh/generation_benchmarking_2
saaketh/hf_checkpoint_hang
saaketh/hf_ckpt_logs
saaketh/hf_ckpt_mem
saaketh/icl_req_false
saaketh/logs-inv
saaketh/lora_init_test
saaketh/meta_rope
saaketh/modified_initialization
saaketh/moe_defaults
saaketh/name_or_path
saaketh/openai-bump
saaketh/peft_trainable
saaketh/pep585
saaketh/qlora_eval
saaketh/quant_save
saaketh/readme_installs_fix
saaketh/remove_olmo
saaketh/remove_te
saaketh/replication_test
saaketh/revert_dataloader
saaketh/streaming_v081
saaketh/streaming076
saaketh/streaming-0100
saaketh/update_yamls
schema-perms-user-error
science/peft
sequential_code_gen_samples
sharegpt-format
sharegpt-format-eitan
shitaoli-db/MCLOUD-4623
shitaoli-db/fix-dataloader-error
staging-debug
structured-logs
tessa/callib
tessa/callibration-script
tessa/copyrighteval
tessa/output_eval_logging
tessa-safety-eval
test-gpu
test-sharding
testing-semdedup1
tool-use
torch-2-3-bump-1
torch-2.7-upgrade-ricky
torch-mem
torch-upgrade
train-cli
truthfulqa
update_gpu_tests
update-version
updt_new_group
use_remote_uploader_v2
validate-cluster-delta
vanshcsingh/add-logging
verbose-foundry
vincent-remove-composer-init
will/test_deps
xiaohan/delta_converter_upgrade
xiaohan/delta-streaming-test
xiaohan/enable_extra_arg_test
xiaohan/env_no_databricks
xiaohan/fix_setuppy
zero-shot
rm tag
Vincent Chen
committed
291 days ago
d447c1cc
comment
Vincent Chen
committed
291 days ago
e623bfe5
comment
Vincent Chen
committed
291 days ago
31722458
comment ghcr
Vincent Chen
committed
291 days ago
efd60068
Bump FA2 to 2.7.4.post1 (#1728)
KuuCi
committed
291 days ago
Verified
f3c6ec20
Bump Transformer v4.49.0 (#1735)
KuuCi
committed
291 days ago
Verified
a0ae0258
Bump composer to 0.29.0 (#1733)
rithwik-db
committed
297 days ago
Verified
c66bc22a
Fix dtype issue in transformers (#1734)
dakinggg
committed
298 days ago
Verified
09e40e84
Bump TE for FA 2.7.1.post1 bump (#1730)
KuuCi
committed
299 days ago
Verified
0dcd86e7
remove deprecated param (#1727)
bigning
committed
306 days ago
Verified
ca7e060e
Bump datasets version (#1724)
dakinggg
committed
310 days ago
Verified
a6a3c569
Update accelerate requirement from <1.2,>=0.25 to >=0.25,<1.4 (#1714)
dependabot[bot]
committed
326 days ago
Verified
e03b23d9
Bump version to 0.18.0.dev (#1717)
milocress
committed
331 days ago
Verified
a02b90d0
Refactor HF checkpointer (#1690)
milocress
committed
332 days ago
Verified
cc0df9f3
Update mcli examples to use 0.16.0 (#1713)
irenedea
committed
344 days ago
Verified
63a733d8
Bump version to 0.17.0.dev0 (#1712)
irenedea
committed
345 days ago
Verified
ee853577
Update mosaicml-streaming to 0.11.0 (#1711)
es94129
committed
347 days ago
Verified
00500ad9
Bump coverage[toml] from 7.6.4 to 7.6.10 (#1702)
dependabot[bot]
committed
353 days ago
Verified
8781b2c9
Update datasets requirement from <3.2,>=2.20.0 to >=2.20.0,<3.3 (#1698)
dependabot[bot]
committed
353 days ago
Verified
24c3ad6c
Add permission error (#1703)
b-chu
committed
354 days ago
Verified
3e3bc5f1
Update pycln (#1704)
b-chu
committed
355 days ago
Verified
0959e9cf
Adding preprocessors for QA and messages datasets (#1700)
ShashankMosaicML
committed
1 year ago
Verified
3269c739
Make loaded peft adapters optionally trainable (#1701)
snarayan21
committed
1 year ago
Verified
5a62fbac
Catch multiple slashes in source dataset into one slash (#1697)
KuuCi
committed
1 year ago
Verified
c494017b
Update datasets requirement from <2.21,>=2.20.0 to >=2.20.0,<3.2 (#1670)
dependabot[bot]
committed
1 year ago
Verified
a27c7200
Update example yamls to use newest foundry version (#1689)
snarayan21
committed
1 year ago
Verified
7b8bf5f6
Fix llama3 example yamls (#1688)
j316chuck
committed
1 year ago
Verified
2b9f6828
Add llama3 ft example yamls (#1686)
j316chuck
committed
1 year ago
Verified
05563e1f
Expose `DistributedSampler` RNG seed argument (#1677)
janEbert
committed
1 year ago
Verified
ff3d9018
Bump Composer to v0.28.0 (#1687)
snarayan21
committed
1 year ago
Verified
f0cf727b
Older