Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
mosaicml/llm-foundry
Pull Requests
Commits
mpt-7b-test
3.10-to-string
CPT_Offset
abhay/add_warmrestarts
abhay/better_logging
abhay/mm_dev
add_bbh
add_math
add_trtllm_dev
add_trtllm_wrapper
add-mflow-logger
aditi-generate-evals
angel/catch-file-not-found
angel/log-data-for-run-analytics
anna/dbrxeval
anna/eval-loader
anna/patch
anna/sp-import
asfandyar/fmapi
batch_code_eval_withtasks
bcui/tokenizer_breaking_checkpointing
bigning-patch-1
bilal-activation-monitor
boweny/composer-0.31
boweny/fsdp2/playground
boweny/param-init-dtensor
boweny/playground/onboarding
boweny/torch-2.7
boweny/use-foundry-image
brier_score
bruce/fix-wikiqa
bruce/vllm-eval
bruce/vllm-eval-v2
bruce/8k
bump_composer_version_0.26.0
bump_dev_version
bump_foundry_0.15.0_yamls
bump_mcli_examples_v0.13.0
bump_mcli_examples_v0.14.0
bump_mcli_examples_v0.16.0
bump_version_v0.7.0
bump_version_v0.9.0.dev0
bump_version_v0.18.0.dev0
bump_version_0.12.0.dev0
bump_version_0.14.0.dev0
bump_version_0.15.0.dev0
bump_version_0.16.0.dev0
bump_version_0.17.0.dev0
bump_version_0.17.0
bump-0.20.0.dev0
bump-composer-0.30.0
bump-example-yamls-0.19.0
bump-fa2-2.7.4.post1
bump-streaming-0.12.0
bump-te-1.3
byod/data_validation_fix
byod/data_validation
cache-docker-release-builds
catch-cluster-perm
catch-delta-table-not-found
catch-grcp-hardware
chronos-test-vincen
chuck/add_foundry_te_again
chuck/add_foundry_te_docker_no_deps
chuck/add_foundry_te_docker
chuck/add_foundry_te_torch_2_1
chuck/add_hf_ckpt_fix
chuck/add_llama3_yaml
chuck/add_te_together
chuck/add_te
chuck/add_torch_2_4_nightly_image
chuck/bump_mosaicml_version_again
chuck/bump_mosaicml_version
chuck/bump_torch_version
chuck/bump_torch_2_4
chuck/bump-torch-2-5
chuck/debug_keys
chuck/fix_eval_with_drop_last_flag
chuck/fix_hf_task_format
chuck/fix_llama3_yaml
chuck/fix_llm_foundry
chuck/fix_te_docker_shard
chuck/fix_te_eval_with_drop_last_flag
chuck/gpu-build-te
chuck/gpu-build-te-win
chuck/log_mpt_config
chuck/replace_hf_causal_lm
chuck/revert_te
chuck/rl_bpt
chuck/rl
chuck/save_te_onnx_export_main
chuck/speedup_add_foundry_te_docker_no_dep
chuck/te-install
chuck/test_callback_load
chuck/test_ckpt_fix
chuck/test_one_more_te
chuck/test_te_shard_weight
chuck/torch_2_5_bump
chuck/update_bpt
chuck/update_setup_dockerfile
clear-entrypoint-cli
cli99/eval
cli99/vllm-eval-v2-lctx
codestar12-patch-1
comment-ghcr
composer-bump
convert_examples_ckpt-cli
dataforge/enable_all_cpu
davis/update-lion8b
debug_f1_score
debug_gauntlet_v0.3
debug_hang
debug_resumeoom
debug_triton
dependabot/pip/datasets-gte-3.3.2-and-lt-4.4
dependabot/pip/flash-attn-2.8.3
dependabot/pip/huggingface-hub-hf_xet--gte-0.30.0-and-lt-0.37
dependabot/pip/onnxruntime-1.23.2
dependabot/pip/transformers-gte-v4.51.0-and-lt-4.58
deprecate-fsdp-config
eitan-patch-json
embedding-infer-step-size
enforce-compute-cluster-version
error
ethantang-db/composer_main
ethantang-db/composer_32_1_fix
ethantang-db/dle_package_upgrades
ethantang-db/sdpa
ethantang-db/slowdown_debug
ethantang-db/tokenizers_optional
ethantang-db/upgrade_transformers
ethantang-db/v0.23.0_dev
f1_score
fastrms
finetuning
fix-cl
fix-fp8-act-ckpt-flag
gate-megablocks
generation_kwargs_fix
habana_alpha
hanlin/dbrx_updates
hf_to_ft_convert_fix
hfcheckpointer-optional-generation-config
human_eval_pack
initialization_postlayernorms_residuals
irene/world-size-test
james/unsafe-types
jane/add-exceptions
jane/add-ft-error-handling
jane/download-hf-to-uc
jane/fail-run
jane/fix-error
jane/mlflow-upload
jane/re-on-timeout
jane/remove-rich
jane/test-exceptions
jerry/experimental
jerry/mlflow-objectstore-exp
jerry/model-cache
jerry/oras
jfrankle-061523
jfrankle-fix
jimmy/data-split
josejg/harmony-fix
josejg/harmony-fix-debug
josejg/qknorm
josejg/tunes
josejg-envlogger-name
josejg-register-nanmonitor
kmmlu
linden/tp
llama_cot_fix
lupesko-patch-1
main
manual-cl
matt/registration-error
matt/split-mds-script
matt/split-mds-script-new
mcli-version-bump
milo/catch-bad-split-regex
milo/catch-more-grpc
milo/catch-more-unknown-example-types
milo/catch-more-unknown-example-types-1
milo/data-prep
milo/fix-retry-crash-loop
milo/fsdp-2-playground
milo/harbor-checkpointer
milo/uncomment-gpu-tests
milo/update-readme-for-variables
milo/update-version-names
milo/wrap-spark-errors-ii
model_gauntlet_v0.1
mpt-7b-test
mpt-quantization-eval
mvpatel2000/fuse-chunk
mvpatel2000/mla
mvpatel2000/relu-attn
mvpatel2000/relu-squared-attn
mvpatel2000/sync
nancy/combined
nancy/register-model
nancyhung/update-error-message
nicholas/exp-debug-issues
nicholas/finetuning-exp
nicholas/uc-upload
nik/ds-llama
nik/ft-model-handler
open-source-embeddings
openai_compatible_gauntlet
output_eval_logging
perms-select-table
pipeline-default-none
quantization-benchmarking
rag_generation_tasks
rag_plus_f1
refactor_Qa
registration
release/v0.2.0
release/v0.3.0
release/v0.4.0
release/v0.5.0
release/v0.6.0
release/v0.7.0
release/v0.8.0
release/v0.9.0
release/v0.9.1
release/v0.10.0
release/v0.11.0
release/v0.12.0
release/v0.13.0
release/v0.13.1
release/v0.14.0
release/v0.14.1
release/v0.14.2
release/v0.14.3
release/v0.14.4
release/v0.14.5
release/v0.15.0
release/v0.15.1
release/v0.16.0
release/v0.17.0
release/v0.17.1
release/v0.18.0
release/v0.19.0
release/v0.20.0
release/v0.21.0
release/v0.22.0
release-base-images
release-docker-img
remove_cot
replace-fsdp-args
replace-gpu-testing
revert-1255-bump_version_v0.10.0.dev0
revert-1517-replace-fsdp-args
revert-1571-autoresume
revert-1636-dependabot/pip/databricks-connect-15.4.3
ricky-fsdp2-temp-version
ricky-yamls
rl-testing-ricky
rm_compile_glu
rm_torch
saaketh/NoOpTim
saaketh/cat-quantize
saaketh/composer_bump_0240
saaketh/composer_022_upgrade
saaketh/composer-bump-0280
saaketh/dataset_rev
saaketh/date_string
saaketh/docker_img_torch_bump
saaketh/fc_config
saaketh/float8_exp_linears
saaketh/fused_glu
saaketh/generation_benchmarking
saaketh/generation_benchmarking_2
saaketh/hf_checkpoint_hang
saaketh/hf_ckpt_logs
saaketh/hf_ckpt_mem
saaketh/icl_req_false
saaketh/logs-inv
saaketh/lora_init_test
saaketh/meta_rope
saaketh/modified_initialization
saaketh/moe_defaults
saaketh/name_or_path
saaketh/openai-bump
saaketh/peft_trainable
saaketh/pep585
saaketh/qlora_eval
saaketh/quant_save
saaketh/readme_installs_fix
saaketh/remove_olmo
saaketh/remove_te
saaketh/replication_test
saaketh/revert_dataloader
saaketh/streaming_v081
saaketh/streaming076
saaketh/streaming-0100
saaketh/update_yamls
schema-perms-user-error
science/peft
sequential_code_gen_samples
sharegpt-format
sharegpt-format-eitan
shitaoli-db/MCLOUD-4623
shitaoli-db/fix-dataloader-error
staging-debug
structured-logs
tessa/callib
tessa/callibration-script
tessa/copyrighteval
tessa/output_eval_logging
tessa-safety-eval
test-gpu
test-sharding
testing-semdedup1
tool-use
torch-2-3-bump-1
torch-2.7-upgrade-ricky
torch-mem
torch-upgrade
train-cli
truthfulqa
update_gpu_tests
update-version
updt_new_group
use_remote_uploader_v2
validate-cluster-delta
vanshcsingh/add-logging
verbose-foundry
vincent-remove-composer-init
will/test_deps
xiaohan/delta_converter_upgrade
xiaohan/delta-streaming-test
xiaohan/enable_extra_arg_test
xiaohan/env_no_databricks
xiaohan/fix_setuppy
zero-shot
device mesh
Ning Wang
committed
1 year ago
15fb4ffa
Merge branch 'mpt-7b-test' of github.com:mosaicml/llm-foundry into mpt-7b-test
Ning Wang
committed
1 year ago
a15d9cb2
make eval loader to None
Ning Wang
committed
1 year ago
e68b7401
make eval load to None
Ning Wang
committed
1 year ago
fbe64944
fix train
Ning Wang
committed
1 year ago
e3bb4fbc
mpt 7b test
Ning Wang
committed
1 year ago
77187d36
add memorysnapshot to callbacks (#810)
cli99
committed
1 year ago
Verified
491fa7ab
Fix eval.py with lora (#965)
dakinggg
committed
1 year ago
Verified
12d1ca73
Fix typo (#966)
irenedea
committed
1 year ago
Verified
c7c9d247
Add streams support (#946)
bigning
committed
1 year ago
Verified
aa0ea6e2
Use create_model_version instead of register_model (#953)
dakinggg
committed
1 year ago
Verified
2f64a144
Add fully configurable activation checkpointing (#951)
cli99
committed
1 year ago
Verified
2e59620a
add finutuning with streaming dataset example (#945)
bigning
committed
1 year ago
Verified
60cdd0be
Bump mcli yaml foundry version to v0.5.0 (#959)
irenedea
committed
1 year ago
Verified
9f101847
Update TUTORIAL.md (#957)
sdonoso
committed
1 year ago
Verified
fe17c224
allow te to use meta device with deferred init (#958)
cli99
committed
1 year ago
Verified
8c7d6f43
Add default signature to mlflow saved model (#952)
dakinggg
committed
1 year ago
Verified
60ab97fb
Add finetuning streaming dataset conversion (#933)
bigning
committed
1 year ago
Verified
105f7663
Fix chain-of-thought tasks (#824)
bmosaicml
committed
1 year ago
Verified
6591f480
Bump llm-foundry version to 0.5.0 (#948)
irenedea
committed
1 year ago
Verified
a667ebf5
Add and use VersionedDeprecationWarning (#944)
irenedea
committed
1 year ago
Verified
c1c4bbfb
Retrieve license information when local files are provided for a pretrained model (#943)
jerrychen109
committed
1 year ago
Verified
2e0a8458
fix (#942)
mvpatel2000
committed
1 year ago
Verified
3f21bb7e
Update lora docs (#941)
dakinggg
committed
1 year ago
Verified
25599294
Fixing the gen_attention_mask_in_length function to handle the case when sequence id is -1 due to attention masking (#940)
ShashankMosaicML
committed
1 year ago
Verified
ad126a62
Remove extra call to .to and load_state_dict in hf checkpointer (#939)
dakinggg
committed
1 year ago
Verified
b9d2bfaf
Refactoring the function to accept list of metric names instead of a dictionary of metrics. (#938)
ShashankMosaicML
committed
1 year ago
Verified
706ea7dd
Switch to the Composer integration of LoRA (works with FSDP) (#886)
dakinggg
committed
1 year ago
Verified
15ee0acc
Update eval_gauntlet_callback.py with math.log2 (#821)
Skylion007
committed
1 year ago
Verified
d9874d2a
bump (#934)
dakinggg
committed
1 year ago
Verified
86b5a981
Older