Add xlstm model #39665

Cyrilvallez merged 163 commits into main from xlstm-model
Cyrilvallez
kpoeppel Add xLSTM cleanly with optimizations.
e03391c0
kpoeppel Fix style.
cf14cea1
kpoeppel Fix modeling test.
4254fdcb
kpoeppel Make xLSTM package optional.
87dd1dbe
kpoeppel Fix: Update torch version check.
7a8d10e7
kpoeppel Fix: Bad variable naming in test.
4456672a
kpoeppel Fix: Import structure cleaning with Ruff.
41173b83
kpoeppel Fix: Update docstrings.
e4afcd1f
kpoeppel Fix: Mitigate unused config attr tests by explicit usage.
cf5bcdcb
kpoeppel Fix: Skip tests, if xlstm library is not installed.
1d6642f2
kpoeppel Feat: Enable longer context window for inference by chunking.
cfabeca3
kpoeppel Fix: Make training test pass by lowering target accuracy.
fa5859c6
kpoeppel Chore: Increase test verbosity for failing generation test.
6896afb4
kpoeppel Update docs/source/en/model_doc/xlstm.md
34c54c3c
kpoeppel Fix: Make xlstm available even without CUDA.
495de792
kpoeppel Chore: Remove unnecessary import.
dd69d761
kpoeppel Fix: Remove BOS insertion.
87010c69
kpoeppel Chore: Improve xLSTMCache documentation.
1e27a6f1
kpoeppel Integrate basic xLSTM fallback code.
c2232115
kpoeppel Chore: Remove unnecessary import.
207bb98e
kpoeppel Chore: Remove duplicate LayerNorm.
79939877
superbock chore: update copyright, minor reformatting
34182bab
superbock fix: refactor mLSTMStateType due to missing torch import
8087039d
superbock fix: add missing import
3482173c
kpoeppel Chore: Replace einops.
71420768
superbock fix: apply ruff formatting
10732c5d
superbock fix: run `make fix-copies` to re-generate dummy_pt_objects.py
890c5913
superbock fix: make type hints Python 3.9 compatible
6604d59f
superbock fix: remove obsolete import
ffcf0d60
superbock fix: remove obsolete method from docs
e53c0896
superbock chore: remove obsolete `force_bos_token_insert` from config
3f6aec99
kpoeppel Merge branch 'main' into integrate_xlstm_clean
ee2d0487
kpoeppel Chore: Remove duplicated xLSTMCache class.
07e1271c
kpoeppel Merge branch 'main' into integrate_xlstm_clean
e2c44811
kpoeppel Fix: Formatting of modeling_xlstm.py
22be117c
kpoeppel Chore: Remove xlstm package requirement from test. Re-add update_rnn_…
59b421fd
kpoeppel Merge branch 'main' into integrate_xlstm_clean
34af1097
kpoeppel Fix: Update xLSTMCache docstring.
faffe5f8
kpoeppel Feat: Add proper initialization of xLSTM.
3cabdc59
kpoeppel Chore: Re-format files.
2d758e00
kpoeppel Chore: Adapt format.
26f8f02e
kpoeppel Merge branch 'main' into integrate_xlstm_clean
ff55a7cd
kpoeppel Merge branch 'main' into integrate_xlstm_clean
c55c0fba
kpoeppel Merge branch 'main' into integrate_xlstm_clean
e52386b5
kpoeppel Merge branch 'main' into integrate_xlstm_clean
addefe0c
kpoeppel Fix: xLSTMCache import restructuring.
3b567665
kpoeppel Fix: Add __all__ lists to modeling and configuration files.
05152fbb
kpoeppel Chore: Reformat.
75231aa9
kpoeppel Merge branch 'main' into integrate_xlstm_clean
d01db9f1
kpoeppel Merge branch 'main' into integrate_xlstm_clean
1f2ea7db
kpoeppel Merge branch 'main' into integrate_xlstm_clean
98f4f029
kpoeppel Merge branch 'main' into integrate_xlstm_clean
5e5ed329
kpoeppel Merge branch 'main' into integrate_xlstm_clean
ffe2e911
kpoeppel Merge branch 'main' into integrate_xlstm_clean
d43976cd
kpoeppel Merge branch 'main' into integrate_xlstm_clean
e4becc32
kpoeppel Merge branch 'main' into integrate_xlstm_clean
29f8a3c4
kpoeppel Fix: Remove unnecessary update_rnn_state function.
de1d2a08
kpoeppel Fix: Undo test accuracy quickfix.
ad6f87df
kpoeppel Fix: Update copyright year, remvoe config copy.
eb2a6fd8
kpoeppel Chore: Flatten all internal configs to xLSTMConfig.
61c6555b
kpoeppel Merge branch 'main' into integrate_xlstm_clean
adfc4367
kpoeppel Fix: Unused config variables check.
2c480f93
kpoeppel Chore: Remove unnecessary imports.
11654c9d
kpoeppel Merge branch 'main' into integrate_xlstm_clean
4becf2ea
kpoeppel Fix: Unify xlstm cache argument from batch_size to max_batch_size.
01801dc0
kpoeppel Merge branch 'main' into integrate_xlstm_clean
8f1c0b81
kpoeppel Chore: Remove bad default arg value for xLSTMCache.
e04ee6e5
kpoeppel Merge branch 'main' into integrate_xlstm_clean
bfa4c051
kpoeppel Chore: Rename core configuration arguments to HF default in xLSTM.
d4805862
kpoeppel Merge branch 'main' into integrate_xlstm_clean
65d17415
kpoeppel Chore: Fix formatting.
ed7e4103
kpoeppel Merge branch 'main' into integrate_xlstm_clean
3cae2baf
kpoeppel Fix: xLSTM Cache config access.
7a0a34e4
kpoeppel Merge branch 'main' into integrate_xlstm_clean
8a046c5e
kpoeppel Fix: Update xlstm tests for config update.
c57740e3
kpoeppel Feat: Re-add embbeding_dim, num_blocks config options for compat with…
1628294b
kpoeppel Fix: Configuration xLSTM python3.9 syntax.
666e6fce
kpoeppel Fix: Difference to main in test_utils.py assertion.
4d80f00d
kpoeppel Fix: Bad syntax in xlstm config for python3.9.
7519b826
kpoeppel Fix: xLSTMConfig docstring.
2b2bb03d
kpoeppel Fix: xLSTMConfig docstring.
a2df4dd6
kpoeppel Merge branch 'main' into integrate_xlstm_clean
1d6e3296
kpoeppel Fix typing issues in xLSTM and BeiT, Paligemma.
de022f56
kpoeppel Fix: Exclude xLSTM from test cache utils.
f979b822
kpoeppel Merge branch 'main' into integrate_xlstm_clean
80eb87e3
kpoeppel Merge branch 'main' into integrate_xlstm_clean
c6920160
kpoeppel Merge branch 'main' into integrate_xlstm_clean
e5cdc442
kpoeppel Merge branch 'main' into integrate_xlstm_clean
5b586426
kpoeppel Merge branch 'main' into integrate_xlstm_clean
cea2602a
kpoeppel Merge branch 'main' into integrate_xlstm_clean
4d43edd5
kpoeppel Merge branch 'main' into integrate_xlstm_clean
0922dd57
kpoeppel Merge branch 'main' into integrate_xlstm_clean
206b29fd
kpoeppel Merge branch 'main' into integrate_xlstm_clean
3c6a2abc
kpoeppel Merge branch 'main' into integrate_xlstm_clean
2ada509c
kpoeppel Merge branch 'main' into integrate_xlstm_clean
9e6137e2
kpoeppel Merge branch 'main' into integrate_xlstm_clean
9b61860d
kpoeppel Merge branch 'main' into integrate_xlstm_clean
448e6f42
kpoeppel Merge branch 'main' into integrate_xlstm_clean
9da66534
kpoeppel Merge branch 'main' into integrate_xlstm_clean
6f6bcc66
kpoeppel Merge branch 'main' into integrate_xlstm_clean
d90b73c7
kpoeppel Merge branch 'main' into integrate_xlstm_clean
6d3aa9e4
kpoeppel Merge branch 'main' into integrate_xlstm_clean
a725943e
kpoeppel Merge branch 'main' into integrate_xlstm_clean
cce8b597
kpoeppel Merge branch 'main' into integrate_xlstm_clean
3ff2a273
kpoeppel Merge branch 'main' into integrate_xlstm_clean
6754ca17
kpoeppel Merge branch 'main' into integrate_xlstm_clean
c152d23b
kpoeppel Merge branch 'main' into integrate_xlstm_clean
02c88712
kpoeppel Merge branch 'main' into integrate_xlstm_clean
225ec891
kpoeppel Merge branch 'main' into integrate_xlstm_clean
dae0d81f
kpoeppel Chore: Fix style.
feefeab2
kpoeppel Chore: Fix format.
7a66ecfb
kpoeppel Merge branch 'main' into integrate_xlstm_clean
7067c310
kpoeppel Merge branch 'main' into integrate_xlstm_clean
6b2a79bc
kpoeppel Merge branch 'main' into integrate_xlstm_clean
32dcea8c
kpoeppel Merge branch 'main' into integrate_xlstm_clean
75cb489e
kpoeppel Merge branch 'main' into integrate_xlstm_clean
0068ab6b
kpoeppel Merge branch 'main' into integrate_xlstm_clean
b3f40e38
kpoeppel Merge branch 'main' into integrate_xlstm_clean
a3c7fd7d
kpoeppel Merge branch 'main' into integrate_xlstm_clean
3de5f24a
kpoeppel Merge branch 'main' into integrate_xlstm_clean
7a2983f2
kpoeppel Merge branch 'main' into integrate_xlstm_clean
1bde78a3
kpoeppel Chore: Remove unnecessary LayerNorm, NormLayer layer abstractions.
7c239aaa
kpoeppel Chore: Remove asserts and replace with ValueErrors.
f8fcc6ed
kpoeppel Chore: Update __init__.py structure of xLSTM.
44550fa4
kpoeppel Chore: Clean xLSTM initialization of weights.
c7ce6a50
kpoeppel Fix index names in modeling_xlstm.py
5f8a399c
kpoeppel Update xlstm model test typing annotations.
b15d6b8d
kpoeppel Merge branch 'main' into integrate_xlstm_clean
bd288058
kpoeppel Merge branch 'main' into integrate_xlstm_clean
bd9ae673
kpoeppel Merge branch 'main' into integrate_xlstm_clean
3a5bab8a
kpoeppel Fix: Remove all asserts.
a010b18b
kpoeppel Revert changes to the main __init__.py
ab301098
kpoeppel Fix: Move xLSTMCache to modeling_xlstm.py
718e4432
kpoeppel Fix: Remove xLSTMForCausalLM mapping from modeling_auto.py
55ef5915
kpoeppel Remove xLSTMCache from dummy_pt_objects.py
733a42a6
kpoeppel Fix: Remove extended torchdynamo compilation check integrating cuda g…
e955830d
kpoeppel Revert test_cache_utils.py xLSTM change.
0fe56334
kpoeppel Fix: Move xLSTM init functions before init call.
c1a69b38
kpoeppel Remove xLSTMCache from generation utils.
78eb2576
kpoeppel Fix: Clean xLSTM init functionality for recursive calls.
66e98cb2
kpoeppel Fix: Move xLSTMCache before its first call.
8dc3179a
kpoeppel Fix formatting.
70f797c5
kpoeppel Add partial docstring for xLSTMModel forward.
468b97f2
kpoeppel Fix xLSTMCache docstring in xLSTMModel.
89f623f8
kpoeppel Remove xLSTMCache from public documentation. Update auto_docstring.
55bc9724
kpoeppel Merge branch 'main' into integrate_xlstm_clean
2480d312
kpoeppel Merge branch 'main' into integrate_xlstm_clean
5b13a31f
kpoeppel Merge branch 'main' into integrate_xlstm_clean
99137e89
Cyrilvallez Merge branch 'main' into xlstm-model
55814f09
Cyrilvallez Remove all agressive shape comments
e4c94e5b
Cyrilvallez style
bf357449
HuggingFaceDocBuilderDev
Cyrilvallez Fix names
e4969e65
Cyrilvallez simplify
d867780e
Cyrilvallez remove output_hidden_states
3adc3d55
Cyrilvallez Update modeling_xlstm.py
f17ceae0
Cyrilvallez Update modeling_xlstm.py
35aa0c28
Cyrilvallez Update test_modeling_xlstm.py
4ee3afcc
Cyrilvallez Update modeling_xlstm.py
b2f0da55
Cyrilvallez Update modeling_xlstm.py
00bb1d12
Cyrilvallez fix
5aa1ee5f
github-actions
Cyrilvallez fix
2337f808
Cyrilvallez style
d496c729
Cyrilvallez style
a876235a
Cyrilvallez Cyrilvallez merged 6630c5b7 into main 151 days ago
Cyrilvallez Cyrilvallez deleted the xlstm-model branch 151 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone