Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
mosaicml/llm-foundry
Pull Requests
Commits
chuck/add_foundry_te_docker
3.10-to-string
CPT_Offset
abhay/add_warmrestarts
abhay/better_logging
abhay/mm_dev
add_bbh
add_math
add_trtllm_dev
add_trtllm_wrapper
add-mflow-logger
aditi-generate-evals
angel/catch-file-not-found
angel/log-data-for-run-analytics
anna/dbrxeval
anna/eval-loader
anna/patch
anna/sp-import
asfandyar/fmapi
batch_code_eval_withtasks
bcui/tokenizer_breaking_checkpointing
bigning-patch-1
bilal-activation-monitor
boweny/composer-0.31
boweny/fsdp2/playground
boweny/param-init-dtensor
boweny/playground/onboarding
boweny/torch-2.7
boweny/use-foundry-image
brier_score
bruce/fix-wikiqa
bruce/vllm-eval
bruce/vllm-eval-v2
bruce/8k
bump_composer_version_0.26.0
bump_dev_version
bump_foundry_0.15.0_yamls
bump_mcli_examples_v0.13.0
bump_mcli_examples_v0.14.0
bump_mcli_examples_v0.16.0
bump_version_v0.7.0
bump_version_v0.9.0.dev0
bump_version_v0.18.0.dev0
bump_version_0.12.0.dev0
bump_version_0.14.0.dev0
bump_version_0.15.0.dev0
bump_version_0.16.0.dev0
bump_version_0.17.0.dev0
bump_version_0.17.0
bump-0.20.0.dev0
bump-composer-0.30.0
bump-example-yamls-0.19.0
bump-fa2-2.7.4.post1
bump-streaming-0.12.0
bump-te-1.3
byod/data_validation_fix
byod/data_validation
cache-docker-release-builds
catch-cluster-perm
catch-delta-table-not-found
catch-grcp-hardware
chronos-test-vincen
chuck/add_foundry_te_again
chuck/add_foundry_te_docker_no_deps
chuck/add_foundry_te_docker
chuck/add_foundry_te_torch_2_1
chuck/add_hf_ckpt_fix
chuck/add_llama3_yaml
chuck/add_te_together
chuck/add_te
chuck/add_torch_2_4_nightly_image
chuck/bump_mosaicml_version_again
chuck/bump_mosaicml_version
chuck/bump_torch_version
chuck/bump_torch_2_4
chuck/bump-torch-2-5
chuck/debug_keys
chuck/fix_eval_with_drop_last_flag
chuck/fix_hf_task_format
chuck/fix_llama3_yaml
chuck/fix_llm_foundry
chuck/fix_te_docker_shard
chuck/fix_te_eval_with_drop_last_flag
chuck/gpu-build-te
chuck/gpu-build-te-win
chuck/log_mpt_config
chuck/replace_hf_causal_lm
chuck/revert_te
chuck/rl_bpt
chuck/rl
chuck/save_te_onnx_export_main
chuck/speedup_add_foundry_te_docker_no_dep
chuck/te-install
chuck/test_callback_load
chuck/test_ckpt_fix
chuck/test_one_more_te
chuck/test_te_shard_weight
chuck/torch_2_5_bump
chuck/update_bpt
chuck/update_setup_dockerfile
clear-entrypoint-cli
cli99/eval
cli99/vllm-eval-v2-lctx
codestar12-patch-1
comment-ghcr
composer-bump
convert_examples_ckpt-cli
dataforge/enable_all_cpu
davis/update-lion8b
debug_f1_score
debug_gauntlet_v0.3
debug_hang
debug_resumeoom
debug_triton
dependabot/pip/datasets-gte-3.3.2-and-lt-4.4
dependabot/pip/flash-attn-2.8.3
dependabot/pip/huggingface-hub-hf_xet--gte-0.30.0-and-lt-0.37
dependabot/pip/onnxruntime-1.23.2
dependabot/pip/transformers-gte-v4.51.0-and-lt-4.58
deprecate-fsdp-config
eitan-patch-json
embedding-infer-step-size
enforce-compute-cluster-version
error
ethantang-db/composer_main
ethantang-db/composer_32_1_fix
ethantang-db/dle_package_upgrades
ethantang-db/sdpa
ethantang-db/slowdown_debug
ethantang-db/tokenizers_optional
ethantang-db/upgrade_transformers
ethantang-db/v0.23.0_dev
f1_score
fastrms
finetuning
fix-cl
fix-fp8-act-ckpt-flag
gate-megablocks
generation_kwargs_fix
habana_alpha
hanlin/dbrx_updates
hf_to_ft_convert_fix
hfcheckpointer-optional-generation-config
human_eval_pack
initialization_postlayernorms_residuals
irene/world-size-test
james/unsafe-types
jane/add-exceptions
jane/add-ft-error-handling
jane/download-hf-to-uc
jane/fail-run
jane/fix-error
jane/mlflow-upload
jane/re-on-timeout
jane/remove-rich
jane/test-exceptions
jerry/experimental
jerry/mlflow-objectstore-exp
jerry/model-cache
jerry/oras
jfrankle-061523
jfrankle-fix
jimmy/data-split
josejg/harmony-fix
josejg/harmony-fix-debug
josejg/qknorm
josejg/tunes
josejg-envlogger-name
josejg-register-nanmonitor
kmmlu
linden/tp
llama_cot_fix
lupesko-patch-1
main
manual-cl
matt/registration-error
matt/split-mds-script
matt/split-mds-script-new
mcli-version-bump
milo/catch-bad-split-regex
milo/catch-more-grpc
milo/catch-more-unknown-example-types
milo/catch-more-unknown-example-types-1
milo/data-prep
milo/fix-retry-crash-loop
milo/fsdp-2-playground
milo/harbor-checkpointer
milo/uncomment-gpu-tests
milo/update-readme-for-variables
milo/update-version-names
milo/wrap-spark-errors-ii
model_gauntlet_v0.1
mpt-7b-test
mpt-quantization-eval
mvpatel2000/fuse-chunk
mvpatel2000/mla
mvpatel2000/relu-attn
mvpatel2000/relu-squared-attn
mvpatel2000/sync
nancy/combined
nancy/register-model
nancyhung/update-error-message
nicholas/exp-debug-issues
nicholas/finetuning-exp
nicholas/uc-upload
nik/ds-llama
nik/ft-model-handler
open-source-embeddings
openai_compatible_gauntlet
output_eval_logging
perms-select-table
pipeline-default-none
quantization-benchmarking
rag_generation_tasks
rag_plus_f1
refactor_Qa
registration
release/v0.2.0
release/v0.3.0
release/v0.4.0
release/v0.5.0
release/v0.6.0
release/v0.7.0
release/v0.8.0
release/v0.9.0
release/v0.9.1
release/v0.10.0
release/v0.11.0
release/v0.12.0
release/v0.13.0
release/v0.13.1
release/v0.14.0
release/v0.14.1
release/v0.14.2
release/v0.14.3
release/v0.14.4
release/v0.14.5
release/v0.15.0
release/v0.15.1
release/v0.16.0
release/v0.17.0
release/v0.17.1
release/v0.18.0
release/v0.19.0
release/v0.20.0
release/v0.21.0
release/v0.22.0
release-base-images
release-docker-img
remove_cot
replace-fsdp-args
replace-gpu-testing
revert-1255-bump_version_v0.10.0.dev0
revert-1517-replace-fsdp-args
revert-1571-autoresume
revert-1636-dependabot/pip/databricks-connect-15.4.3
ricky-fsdp2-temp-version
ricky-yamls
rl-testing-ricky
rm_compile_glu
rm_torch
saaketh/NoOpTim
saaketh/cat-quantize
saaketh/composer_bump_0240
saaketh/composer_022_upgrade
saaketh/composer-bump-0280
saaketh/dataset_rev
saaketh/date_string
saaketh/docker_img_torch_bump
saaketh/fc_config
saaketh/float8_exp_linears
saaketh/fused_glu
saaketh/generation_benchmarking
saaketh/generation_benchmarking_2
saaketh/hf_checkpoint_hang
saaketh/hf_ckpt_logs
saaketh/hf_ckpt_mem
saaketh/icl_req_false
saaketh/logs-inv
saaketh/lora_init_test
saaketh/meta_rope
saaketh/modified_initialization
saaketh/moe_defaults
saaketh/name_or_path
saaketh/openai-bump
saaketh/peft_trainable
saaketh/pep585
saaketh/qlora_eval
saaketh/quant_save
saaketh/readme_installs_fix
saaketh/remove_olmo
saaketh/remove_te
saaketh/replication_test
saaketh/revert_dataloader
saaketh/streaming_v081
saaketh/streaming076
saaketh/streaming-0100
saaketh/update_yamls
schema-perms-user-error
science/peft
sequential_code_gen_samples
sharegpt-format
sharegpt-format-eitan
shitaoli-db/MCLOUD-4623
shitaoli-db/fix-dataloader-error
staging-debug
structured-logs
tessa/callib
tessa/callibration-script
tessa/copyrighteval
tessa/output_eval_logging
tessa-safety-eval
test-gpu
test-sharding
testing-semdedup1
tool-use
torch-2-3-bump-1
torch-2.7-upgrade-ricky
torch-mem
torch-upgrade
train-cli
truthfulqa
update_gpu_tests
update-version
updt_new_group
use_remote_uploader_v2
validate-cluster-delta
vanshcsingh/add-logging
verbose-foundry
vincent-remove-composer-init
will/test_deps
xiaohan/delta_converter_upgrade
xiaohan/delta-streaming-test
xiaohan/enable_extra_arg_test
xiaohan/env_no_databricks
xiaohan/fix_setuppy
zero-shot
commit change
Chuck Tang
committed
1 year ago
b3d213a2
Bump datasets version (#1138)
dakinggg
committed
1 year ago
Verified
24f65fd8
Fix saving of generation_config for Llama-3 (#1134)
eldarkurtic
committed
1 year ago
Verified
15abf8c2
First initialize dist with gloo (#1133)
dakinggg
committed
1 year ago
Verified
76f74b69
Fix InvalidPromptResponseKeysError bug (#1131)
b-chu
committed
1 year ago
Verified
6252f791
Mlflow datasets (#1119)
KuuCi
committed
1 year ago
Verified
72da1d70
Update JSONL sources in eval README (#1110)
emmanuel-ferdman
committed
1 year ago
Verified
0d62e611
Fix HF checkpointer + mlflow bugs (#1125)
dakinggg
committed
1 year ago
Verified
c53622e3
Clean up the publicly exported API (#1128)
dakinggg
committed
1 year ago
Verified
49521837
Fix deprecation versions (#1129)
dakinggg
committed
1 year ago
Verified
0c6bd75f
Change main to a dev version (#1126)
dakinggg
committed
1 year ago
Verified
6caa75a0
Pin mlflow (#1124)
dakinggg
committed
1 year ago
Verified
34264151
catch misconfigured hf dataset (#1123)
milocress
committed
1 year ago
Verified
f0646e88
Bump Composer to 0.21.3 (#1122)
b-chu
committed
1 year ago
Verified
698206d0
Add option for subclasses to convert model and tokenizer in hf checkpointer (#1121)
dakinggg
committed
1 year ago
Verified
63a7f125
add `.json` to SUPPORTED_EXTENSIONS (#1114)
eitanturok
committed
1 year ago
Verified
20cb40c3
Bump transformers to 4.40 (#1118)
dakinggg
committed
1 year ago
Verified
4bb4d4a2
Update tests to not rely on mistral (#1117)
dakinggg
committed
1 year ago
Verified
84b64102
Add missing init file (#1113)
dakinggg
committed
1 year ago
Verified
f01f6256
Param init registry (#1096)
dakinggg
committed
1 year ago
Verified
676ad7f9
FFN layer registry (#1095)
dakinggg
committed
1 year ago
Verified
cb0de4f7
Migrate ICL classes to foundry (#936)
bmosaicml
committed
1 year ago
Verified
3729ba3e
Update config_moe_args.py (#1112)
vchiley
committed
1 year ago
Verified
6257e5b9
Revert "Update config_moe_args.py (#1104)" (#1111)
vchiley
committed
1 year ago
Verified
b58d68c1
Dbrx finetune yaml requires save folder specified to enable autoresume (#1108)
mvpatel2000
committed
1 year ago
Verified
e9b1c6e6
Attention layer registry (#1094)
dakinggg
committed
1 year ago
Verified
560012b6
FC layer registry (#1093)
dakinggg
committed
1 year ago
Verified
ed3daef0
Support ShareGPT chat format (#1098)
samhavens
committed
1 year ago
Verified
4cd2324e
GRT-2819 fix overwritting in script (#1107)
cli99
committed
1 year ago
Verified
b5fc0fad
Add remote code option to allow execution of DBRX tokenizer (#1106)
b-chu
committed
1 year ago
Verified
7337429d
Older