Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
mosaicml/llm-foundry
Pull Requests
Commits
boweny/fsdp2/playground
3.10-to-string
CPT_Offset
abhay/add_warmrestarts
abhay/better_logging
abhay/mm_dev
add_bbh
add_math
add_trtllm_dev
add_trtllm_wrapper
add-mflow-logger
aditi-generate-evals
angel/catch-file-not-found
angel/log-data-for-run-analytics
anna/dbrxeval
anna/eval-loader
anna/patch
anna/sp-import
asfandyar/fmapi
batch_code_eval_withtasks
bcui/tokenizer_breaking_checkpointing
bigning-patch-1
bilal-activation-monitor
boweny/composer-0.31
boweny/fsdp2/playground
boweny/param-init-dtensor
boweny/playground/onboarding
boweny/torch-2.7
boweny/use-foundry-image
brier_score
bruce/fix-wikiqa
bruce/vllm-eval
bruce/vllm-eval-v2
bruce/8k
bump_composer_version_0.26.0
bump_dev_version
bump_foundry_0.15.0_yamls
bump_mcli_examples_v0.13.0
bump_mcli_examples_v0.14.0
bump_mcli_examples_v0.16.0
bump_version_v0.7.0
bump_version_v0.9.0.dev0
bump_version_v0.18.0.dev0
bump_version_0.12.0.dev0
bump_version_0.14.0.dev0
bump_version_0.15.0.dev0
bump_version_0.16.0.dev0
bump_version_0.17.0.dev0
bump_version_0.17.0
bump-0.20.0.dev0
bump-composer-0.30.0
bump-example-yamls-0.19.0
bump-fa2-2.7.4.post1
bump-streaming-0.12.0
bump-te-1.3
byod/data_validation_fix
byod/data_validation
cache-docker-release-builds
catch-cluster-perm
catch-delta-table-not-found
catch-grcp-hardware
chronos-test-vincen
chuck/add_foundry_te_again
chuck/add_foundry_te_docker_no_deps
chuck/add_foundry_te_docker
chuck/add_foundry_te_torch_2_1
chuck/add_hf_ckpt_fix
chuck/add_llama3_yaml
chuck/add_te_together
chuck/add_te
chuck/add_torch_2_4_nightly_image
chuck/bump_mosaicml_version_again
chuck/bump_mosaicml_version
chuck/bump_torch_version
chuck/bump_torch_2_4
chuck/bump-torch-2-5
chuck/debug_keys
chuck/fix_eval_with_drop_last_flag
chuck/fix_hf_task_format
chuck/fix_llama3_yaml
chuck/fix_llm_foundry
chuck/fix_te_docker_shard
chuck/fix_te_eval_with_drop_last_flag
chuck/gpu-build-te
chuck/gpu-build-te-win
chuck/log_mpt_config
chuck/replace_hf_causal_lm
chuck/revert_te
chuck/rl_bpt
chuck/rl
chuck/save_te_onnx_export_main
chuck/speedup_add_foundry_te_docker_no_dep
chuck/te-install
chuck/test_callback_load
chuck/test_ckpt_fix
chuck/test_one_more_te
chuck/test_te_shard_weight
chuck/torch_2_5_bump
chuck/update_bpt
chuck/update_setup_dockerfile
clear-entrypoint-cli
cli99/eval
cli99/vllm-eval-v2-lctx
codestar12-patch-1
comment-ghcr
composer-bump
convert_examples_ckpt-cli
dataforge/enable_all_cpu
davis/update-lion8b
debug_f1_score
debug_gauntlet_v0.3
debug_hang
debug_resumeoom
debug_triton
dependabot/pip/datasets-gte-3.3.2-and-lt-4.4
dependabot/pip/flash-attn-2.8.3
dependabot/pip/huggingface-hub-hf_xet--gte-0.30.0-and-lt-0.37
dependabot/pip/onnxruntime-1.23.2
dependabot/pip/transformers-gte-v4.51.0-and-lt-4.58
deprecate-fsdp-config
eitan-patch-json
embedding-infer-step-size
enforce-compute-cluster-version
error
ethantang-db/composer_main
ethantang-db/composer_32_1_fix
ethantang-db/dle_package_upgrades
ethantang-db/sdpa
ethantang-db/slowdown_debug
ethantang-db/tokenizers_optional
ethantang-db/upgrade_transformers
ethantang-db/v0.23.0_dev
f1_score
fastrms
finetuning
fix-cl
fix-fp8-act-ckpt-flag
gate-megablocks
generation_kwargs_fix
habana_alpha
hanlin/dbrx_updates
hf_to_ft_convert_fix
hfcheckpointer-optional-generation-config
human_eval_pack
initialization_postlayernorms_residuals
irene/world-size-test
james/unsafe-types
jane/add-exceptions
jane/add-ft-error-handling
jane/download-hf-to-uc
jane/fail-run
jane/fix-error
jane/mlflow-upload
jane/re-on-timeout
jane/remove-rich
jane/test-exceptions
jerry/experimental
jerry/mlflow-objectstore-exp
jerry/model-cache
jerry/oras
jfrankle-061523
jfrankle-fix
jimmy/data-split
josejg/harmony-fix
josejg/harmony-fix-debug
josejg/qknorm
josejg/tunes
josejg-envlogger-name
josejg-register-nanmonitor
kmmlu
linden/tp
llama_cot_fix
lupesko-patch-1
main
manual-cl
matt/registration-error
matt/split-mds-script
matt/split-mds-script-new
mcli-version-bump
milo/catch-bad-split-regex
milo/catch-more-grpc
milo/catch-more-unknown-example-types
milo/catch-more-unknown-example-types-1
milo/data-prep
milo/fix-retry-crash-loop
milo/fsdp-2-playground
milo/harbor-checkpointer
milo/uncomment-gpu-tests
milo/update-readme-for-variables
milo/update-version-names
milo/wrap-spark-errors-ii
model_gauntlet_v0.1
mpt-7b-test
mpt-quantization-eval
mvpatel2000/fuse-chunk
mvpatel2000/mla
mvpatel2000/relu-attn
mvpatel2000/relu-squared-attn
mvpatel2000/sync
nancy/combined
nancy/register-model
nancyhung/update-error-message
nicholas/exp-debug-issues
nicholas/finetuning-exp
nicholas/uc-upload
nik/ds-llama
nik/ft-model-handler
open-source-embeddings
openai_compatible_gauntlet
output_eval_logging
perms-select-table
pipeline-default-none
quantization-benchmarking
rag_generation_tasks
rag_plus_f1
refactor_Qa
registration
release/v0.2.0
release/v0.3.0
release/v0.4.0
release/v0.5.0
release/v0.6.0
release/v0.7.0
release/v0.8.0
release/v0.9.0
release/v0.9.1
release/v0.10.0
release/v0.11.0
release/v0.12.0
release/v0.13.0
release/v0.13.1
release/v0.14.0
release/v0.14.1
release/v0.14.2
release/v0.14.3
release/v0.14.4
release/v0.14.5
release/v0.15.0
release/v0.15.1
release/v0.16.0
release/v0.17.0
release/v0.17.1
release/v0.18.0
release/v0.19.0
release/v0.20.0
release/v0.21.0
release/v0.22.0
release-base-images
release-docker-img
remove_cot
replace-fsdp-args
replace-gpu-testing
revert-1255-bump_version_v0.10.0.dev0
revert-1517-replace-fsdp-args
revert-1571-autoresume
revert-1636-dependabot/pip/databricks-connect-15.4.3
ricky-fsdp2-temp-version
ricky-yamls
rl-testing-ricky
rm_compile_glu
rm_torch
saaketh/NoOpTim
saaketh/cat-quantize
saaketh/composer_bump_0240
saaketh/composer_022_upgrade
saaketh/composer-bump-0280
saaketh/dataset_rev
saaketh/date_string
saaketh/docker_img_torch_bump
saaketh/fc_config
saaketh/float8_exp_linears
saaketh/fused_glu
saaketh/generation_benchmarking
saaketh/generation_benchmarking_2
saaketh/hf_checkpoint_hang
saaketh/hf_ckpt_logs
saaketh/hf_ckpt_mem
saaketh/icl_req_false
saaketh/logs-inv
saaketh/lora_init_test
saaketh/meta_rope
saaketh/modified_initialization
saaketh/moe_defaults
saaketh/name_or_path
saaketh/openai-bump
saaketh/peft_trainable
saaketh/pep585
saaketh/qlora_eval
saaketh/quant_save
saaketh/readme_installs_fix
saaketh/remove_olmo
saaketh/remove_te
saaketh/replication_test
saaketh/revert_dataloader
saaketh/streaming_v081
saaketh/streaming076
saaketh/streaming-0100
saaketh/update_yamls
schema-perms-user-error
science/peft
sequential_code_gen_samples
sharegpt-format
sharegpt-format-eitan
shitaoli-db/MCLOUD-4623
shitaoli-db/fix-dataloader-error
staging-debug
structured-logs
tessa/callib
tessa/callibration-script
tessa/copyrighteval
tessa/output_eval_logging
tessa-safety-eval
test-gpu
test-sharding
testing-semdedup1
tool-use
torch-2-3-bump-1
torch-2.7-upgrade-ricky
torch-mem
torch-upgrade
train-cli
truthfulqa
update_gpu_tests
update-version
updt_new_group
use_remote_uploader_v2
validate-cluster-delta
vanshcsingh/add-logging
verbose-foundry
vincent-remove-composer-init
will/test_deps
xiaohan/delta_converter_upgrade
xiaohan/delta-streaming-test
xiaohan/enable_extra_arg_test
xiaohan/env_no_databricks
xiaohan/fix_setuppy
zero-shot
1024
bowenyang008
committed
275 days ago
095c9a76
Merge remote-tracking branch 'origin/main' into boweny/fsdp2/playground
bowenyang008
committed
276 days ago
0cb57404
Update ci-testing version to latest (#1827)
dakinggg
committed
276 days ago
dc10d0a5
Delete useless print("here") (#1826)
Omar Zoloev
committed
276 days ago
e74b7c35
Bump docformatter for python3.12 and change blank_line_before_module_docstring = false (#1825)
sashaDoubov
committed
276 days ago
eef14258
Update ci-testing version to latest (#1827)
dakinggg
committed
277 days ago
Verified
358b23aa
Delete useless print("here") (#1826)
Omar Zoloev
committed
277 days ago
Verified
6b6ec053
Bump docformatter for python3.12 and change blank_line_before_module_docstring = false (#1825)
sashaDoubov
committed
278 days ago
Verified
f0018598
revert
bowenyang008
committed
278 days ago
726f93a4
Merge remote-tracking branch 'origin/main' into boweny/fsdp2/playground
bowenyang008
committed
278 days ago
dfac27ae
Bump onnx from 1.17.0 to 1.18.0 (#1823)
dependabot[bot]
committed
279 days ago
Verified
8aad84d5
Update accelerate requirement from <1.7,>=0.25 to >=0.25,<1.8 (#1824)
dependabot[bot]
committed
279 days ago
Verified
f83434f6
Fix Dtensor initialization (#1820)
bowenyang008
committed
282 days ago
Verified
d39debbe
Revert "hack again"
bowenyang008
committed
282 days ago
df2a7e1d
hack again
bowenyang008
committed
282 days ago
5967d0cc
Revert "benchmark flag"
bowenyang008
committed
282 days ago
2ddfda5c
Revert "return hack"
bowenyang008
committed
282 days ago
4912fe3c
return hack
bowenyang008
committed
282 days ago
d84d9ca9
benchmark flag
bowenyang008
committed
282 days ago
65b12517
Revert "Revert "log time""
bowenyang008
committed
282 days ago
f053b28e
Revert "log time"
bowenyang008
committed
282 days ago
0b8004cc
log time
bowenyang008
committed
282 days ago
2b76179f
Deprecate inference API wrappers (#1821)
dakinggg
committed
282 days ago
Verified
8c865f4d
Merge remote-tracking branch 'origin/boweny/param-init-dtensor' into boweny/fsdp2/playground
bowenyang008
committed
282 days ago
134adb64
pyright
bowenyang008
committed
282 days ago
e2730b9c
fix return type
bowenyang008
committed
282 days ago
58832b87
format
bowenyang008
committed
282 days ago
becda286
format
bowenyang008
committed
282 days ago
8e1b045f
doc
bowenyang008
committed
282 days ago
f46c0f17
doc
bowenyang008
committed
282 days ago
173050d1
Older