transformers
Fix static generation when compiling!
#28937
Merged

Fix static generation when compiling! #28937

ArthurZucker merged 42 commits into main from fix-static-kv-cache
ArthurZucker
ArthurZucker wow I was scared!
21876858
ArthurZucker ArthurZucker changed the title wow I was scared! Fix static generation when compiling! 1 year ago
HuggingFaceDocBuilderDev
ArthurZucker fix everything
4922c924
ArthurZucker nits
56768a02
sanchit-gandhi
ArthurZucker
ArthurZucker make it BC?
b5650515
ArthurZucker add todo
99afd1ad
ArthurZucker
ArthurZucker commented on 2024-02-12
ArthurZucker nits
edc498fb
ArthurZucker is_tracing should still be used to pass tracing tests
651c4bd8
ArthurZucker nits
f69626e1
ArthurZucker ArthurZucker marked this pull request as ready for review 1 year ago
ArthurZucker some nits to make sure genration works with static cache uncompiled
96136acb
ArthurZucker fix sdpa
d5ebd806
gante
gante commented on 2024-02-12
ArthurZucker
ArthurZucker commented on 2024-02-13
gante
gante approved these changes on 2024-02-13
ArthurZucker
ArthurZucker fix FA2 for both static and dynamic in a better way?
70adcf66
ArthurZucker
ArthurZucker style
61ed4cb4
ArthurZucker fix-copies
fedc5633
ArthurZucker fix fix copies
0195d58d
ArthurZucker fix sequential beam searcg
07f3adbb
ArthurZucker style
9402c25e
ArthurZucker use `keys_to_ignore`
86303c4d
ArthurZucker nit
fb9e9072
ArthurZucker correct dtype inference when init
9aa667e0
ArthurZucker :( the fix for FA2 is still not optimal to investigate!
68a5f294
ArthurZucker styling
3b9969b7
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into fix-s…
162ab877
ArthurZucker nits
914b0d7d
ArthurZucker nit
e79f79f6
ArthurZucker this might work better
ee2317d3
ArthurZucker add comment
93b2691a
ArthurZucker
ArthurZucker commented on 2024-02-14
ArthurZucker Update src/transformers/models/llama/modeling_llama.py
3619ed30
ArthurZucker "position_ids" -> "cache_position"
c23cdc42
ArthurZucker style
717a8e75
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into fix-s…
7fe09642
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into fix-s…
464c4637
ArthurZucker nit
80148abe
ArthurZucker
ArthurZucker commented on 2024-02-15
ArthurZucker
ArthurZucker commented on 2024-02-15
ArthurZucker Remove changes that should no be propagatted just yet
c9f3c828
ArthurZucker Apply suggestions from code review
5f54d84e
ArthurZucker Styling
b3fc0428
ArthurZucker make sure we raise an errir for static cache with FA2 enabled
5fdb2da8
younesbelkada
younesbelkada commented on 2024-02-15
younesbelkada
ArthurZucker move to the bottom of the signature
03edf912
ArthurZucker style
b762304e
ArthurZucker Update src/transformers/models/llama/modeling_llama.py
9fbe9014
ArthurZucker
ArthurZucker commented on 2024-02-15
ArthurZucker Update src/transformers/models/llama/modeling_llama.py
7afe7d93
ArthurZucker nit in the name
3772d1ca
ArthurZucker Merge branches 'fix-static-kv-cache' and 'fix-static-kv-cache' of git…
cf0bc324
younesbelkada
younesbelkada approved these changes on 2024-02-15
ArthurZucker ArthurZucker merged f3788b09 into main 1 year ago
ArthurZucker ArthurZucker deleted the fix-static-kv-cache branch 1 year ago
alanwaketan
ArthurZucker
learning-chip
ArthurZucker
ArthurZucker

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone