Falcon port #24523

Rocketknight1 merged 82 commits into main from falcon_port
Rocketknight1
Rocketknight1 Rocketknight1 requested a review from sgugger sgugger 2 years ago
Rocketknight1 Rocketknight1 requested a review from LysandreJik LysandreJik 2 years ago
sgugger
sgugger commented on 2023-06-27
Rocketknight1
Rocketknight1 Rocketknight1 force pushed to 290dd307 2 years ago
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker
ArthurZucker commented on 2023-06-30
WilliamTambellini
Rocketknight1 Rocketknight1 force pushed to a17e7831 2 years ago
Rocketknight1 Rocketknight1 requested a review from amyeroberts amyeroberts 2 years ago
Rocketknight1
amyeroberts
amyeroberts commented on 2023-07-05
miladm
miladm commented on 2023-07-05
amyeroberts
amyeroberts approved these changes on 2023-07-07
Rocketknight1 Initial commit
37f084a6
Rocketknight1 Update src/transformers/models/falcon/configuration_falcon.py
f7008b46
Rocketknight1 Update src/transformers/models/falcon/configuration_falcon.py
b3278e06
Rocketknight1 Cleanup config docstring
d0c0db5a
Rocketknight1 Update src/transformers/models/falcon/configuration_falcon.py
89a24884
Rocketknight1 Convert to relative imports
3e9f11ef
Rocketknight1 Remove torch < 1.8 warning
9633df5f
Rocketknight1 Restructure cos_sin header
a4b91cde
Rocketknight1 qkv -> query, key, value
f31d0ea6
Rocketknight1 Refactor attention calculation
97a7aa73
Rocketknight1 Add a couple of config variables to account for the different checkpo…
31e24c72
Rocketknight1 Successful merging of the code paths!
88fca168
Rocketknight1 Fix misplaced line in the non-parallel attention path
92da7fe2
Rocketknight1 Update config and tests
e5c67d0d
Rocketknight1 Add a pad_token_id when testing
d4d91bd7
Rocketknight1 Support output_attentions when alibi is None
8febeb9d
Rocketknight1 make fixup
40010512
Rocketknight1 Skip KV cache shape test
8900244c
Rocketknight1 No more _keys_to_ignore_on_load_missing
eb1de871
Rocketknight1 Simplify self attention a bit
3609c58e
Rocketknight1 Simplify self attention a bit
03819d4d
Rocketknight1 make fixup
625baa73
Rocketknight1 stash commit
2c42786d
Rocketknight1 Some more attention mask updates
57e8ad14
Rocketknight1 Should pass all tests except assisted generation!
5cbec392
Rocketknight1 Add big model generation test
92484ab7
Rocketknight1 make fixup
677781a6
Rocketknight1 Add temporary workaround for test
80d8d201
Rocketknight1 Test overrides for assisted generation
e53bf96e
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
aef8be76
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
025435e8
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
d06ce2fc
Rocketknight1 Update tests/models/falcon/test_modeling_falcon.py
3b396073
Rocketknight1 Test overrides for assisted generation
2f738005
Rocketknight1 Add generation demo
eb83a309
Rocketknight1 Update copyright
65207ccc
Rocketknight1 Make the docstring model actually small
05585536
Rocketknight1 Add module-level docstring
b88bd8b1
Rocketknight1 Remove all assertions
e68c77cb
Rocketknight1 Add copied from bloom
ee9a72f9
Rocketknight1 Reformat the QKV layer
21763d58
Rocketknight1 Add copied from bloom
a6ecc913
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
6620500e
Rocketknight1 Remove unused line and reformat
8123582b
Rocketknight1 No single letter variables
96f6c787
Rocketknight1 Cleanup return names
56f44430
Rocketknight1 Add copied from line
f1ff143f
Rocketknight1 Remove the deprecated arguments blocks
339d4653
Rocketknight1 Change the embeddings test to an alibi on/off test
67e530fd
Rocketknight1 Remove position_ids from FalconForQA
ae3cfc6a
Rocketknight1 Remove old check for token type IDs
7623f3a6
Rocketknight1 Fix the alibi path when multi_query is False
b2facf7c
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
1c6353bc
Rocketknight1 Update src/transformers/models/falcon/modeling_falcon.py
b9a2e267
Rocketknight1 Update tests/models/falcon/test_modeling_falcon.py
5863002b
Rocketknight1 Update config naming
456b5239
Rocketknight1 Fix typo for new_decoder_architecture
7a035aed
Rocketknight1 Add some comments
0e661eb0
Rocketknight1 Fix docstring
f5709ad3
Rocketknight1 Fix docstring
a9e0d0e0
Rocketknight1 Create range in the right dtype from the start
3e0b9784
Rocketknight1 Review comment cleanup
58dda7e4
Rocketknight1 n_head_kv -> num_kv_heads
17c4b9da
Rocketknight1 self.alibi -> self.use_alibi
57fe9f24
Rocketknight1 self.num_kv -> self.num_kv_heads
302cc955
Rocketknight1 Reorder config args
b744ccdc
Rocketknight1 Made alibi arguments Optional
e2f2db1e
Rocketknight1 Add all model docstrings
3b948311
Rocketknight1 Add extra checkpoints
ab662380
Rocketknight1 Add author info for Falcon
94c33e9c
Rocketknight1 Stop removing token_type_ids because our checkpoints shouldn't return…
131be5f8
Rocketknight1 Add one hopeful comment for the future
e11ff0cc
Rocketknight1 Fix typo
0de24ed2
Rocketknight1 Update tests, fix cache issue for generation
2bdc92df
Rocketknight1 Use -1e9 instead of -inf to avoid float overflow
e7117195
Rocketknight1 Recompute the rotary embeddings much less often
d7176e27
Rocketknight1 Re-enable disabled tests
21f84bd6
Rocketknight1 One final fix to attention mask calculation, and update tests
33593888
Rocketknight1 Cleanup targeting falcon-40b equivalency
fb4e5557
Rocketknight1 Rocketknight1 force pushed to fb4e5557 2 years ago
Rocketknight1 Post-rebase docs update
28d55b9f
Rocketknight1 Update docstrings, especially in the config
c1a3559e
Rocketknight1 More descriptive variable names, and comments where we can't rename them
5125fb9a
Rocketknight1 Rocketknight1 merged b3ab3fac into main 2 years ago
Rocketknight1 Rocketknight1 deleted the falcon_port branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone