transformers
Add OLMo November 2024
#34551
Merged

Add OLMo November 2024 #34551

2015aroras
2015aroras
2015aroras 2015aroras marked this pull request as draft 1 year ago
2015aroras 2015aroras marked this pull request as ready for review 1 year ago
2015aroras
2015aroras
2015aroras
ArthurZucker
ArthurZucker approved these changes on 2024-11-14
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker
2015aroras 2015aroras force pushed 1 year ago
2015aroras Add model skeletion with transformers-cli add-new-model-like
6e747c27
2015aroras Convert config to modular, add rms_norm_eps, delete clip_qkv
a80ffd18
2015aroras Convert model to modular, add RMSNorm
ffa794e4
2015aroras Add flash attention with qk norm and no qkv clipping
75d38f04
2015aroras Add decoder layer with RMSNorm after attention/feedforward layers
dbd880df
2015aroras Add base and causal model
06c9c44b
2015aroras Add converter improvements from OLMo repo
b73f6d39
2015aroras Update weight loading in OLMo to HF converter
c8d94115
2015aroras Set correct default for rms_norm_eps
4e3da14b
2015aroras Set correct pipeline_model_mapping in test
87d54bb3
2015aroras Run make fixup
b7939d2e
2015aroras Fix model type
d39587fc
2015aroras Re-run modular conversion
30c20f6d
2015aroras Manually set config docs to fix build errors
cdce1572
2015aroras Convert olmo-1124 to olmo_1124 to fix flash attention docs errors
3a9c61c5
2015aroras Start updating tests
949648e7
2015aroras Update tests
0217f409
2015aroras Copy upstream test_eager_matches_sdpa_inference_1_bfloat16 changes to…
1bdaa051
2015aroras Rename input_layernorm and post_attention_layernorm to reflect their …
0b1f2bf5
2015aroras Use correct tokenizer
9e7c77d4
2015aroras Remove test unsupported by GPT2 tokenizer
11f67eb7
2015aroras Create GenerationConfig outside of from_pretrained call
0c2a264e
2015aroras Use simpler init file structure
a22d9366
2015aroras Add explicit __all__ to support simplified init
a3cca575
2015aroras Make safetensor serialization the default
82a75c29
2015aroras 2015aroras force pushed to 82a75c29 1 year ago
2015aroras
2015aroras Update OLMo November 2024 docs
bfd2e635
ArthurZucker
ArthurZucker approved these changes on 2024-11-18
ArthurZucker ArthurZucker merged 3ee24e22 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone