transformers
Cache: new Cache format in decoder-only models
#31421
Merged

Cache: new Cache format in decoder-only models #31421

zucchini-nlp
zucchini-nlp draft bart with new cache
183cd66a
zucchini-nlp add cache for decoder-only models
4578bca4
zucchini-nlp revert utils
9505ca4c
zucchini-nlp modify docstring
2ab28f34
zucchini-nlp revert bart
5fe4e9e3
zucchini-nlp minor fixes
09413c30
zucchini-nlp fix copies (not related)
3c276043
zucchini-nlp zucchini-nlp requested a review from gante gante 1 year ago
zucchini-nlp
zucchini-nlp commented on 2024-06-14
zucchini-nlp
zucchini-nlp commented on 2024-06-14
zucchini-nlp
zucchini-nlp commented on 2024-06-14
zucchini-nlp revert tests
350acc5a
HuggingFaceDocBuilderDev
gante
gante
gante commented on 2024-06-14
zucchini-nlp remove enc-dec related code
c0adf10d
zucchini-nlp remove bloom
c18b1775
zucchini-nlp remove opt (enc-dec)
582f289d
zucchini-nlp Merge remote-tracking branch 'upstream/main' into dynamic_cache_decod…
3141a715
gante
gante approved these changes on 2024-06-17
gante gante requested a review from ArthurZucker ArthurZucker 1 year ago
zucchini-nlp update docstring
33d54b49
zucchini-nlp git, codegen, gpt_neo, gpt_neox, gpj
dd05e6bf
zucchini-nlp
ArthurZucker
ArthurZucker commented on 2024-06-18
zucchini-nlp clean up
cb878d57
zucchini-nlp copied from statements
0588791e
zucchini-nlp revert
a27b47c6
zucchini-nlp tmp
1abcf305
ArthurZucker
zucchini-nlp update warning msg
00ed88c4
zucchini-nlp forgot git
6c3b3aa7
zucchini-nlp
gante
gante commented on 2024-06-20
zucchini-nlp add more flags
fd5eeabb
zucchini-nlp run-slow git,codegen,gpt_neo,gpt_neox,gpj
e233f296
zucchini-nlp
zucchini-nlp add cache flag to VLMs
356d578d
zucchini-nlp remove files
c9066701
zucchini-nlp
zucchini-nlp Merge branch 'main' into dynamic_cache_decoder_only
08d9e6f0
zucchini-nlp style
56c05b2b
zucchini-nlp video LLMs also need a flag
85108109
zucchini-nlp style
cebb55d4
zucchini-nlp zucchini-nlp requested a review from ArthurZucker ArthurZucker 1 year ago
zucchini-nlp llava will go in another PR
8fd9dd1f
zucchini-nlp Merge branch 'main' into dynamic_cache_decoder_only
4b9ced10
zucchini-nlp style
aea219ba
zucchini-nlp zucchini-nlp added run-slow
zucchini-nlp [run-slow] codegen, falcon, git, gpt_neo, gpt_neox, gptj, idefics
49918633
ArthurZucker
ArthurZucker commented on 2024-07-12
zucchini-nlp Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
ec306a26
zucchini-nlp copy from
cf793b7d
zucchini-nlp deprecate until v4.45 and warn if not training
c92409c7
zucchini-nlp nit
c2b97e41
zucchini-nlp fix test
35b60de9
zucchini-nlp test static cache
d2fca9a1
zucchini-nlp Merge branch 'main' into dynamic_cache_decoder_only
0933350e
zucchini-nlp add more tests and fix models
42349d49
zucchini-nlp fix copies
45c3a1bd
zucchini-nlp
zucchini-nlp return sliding window mask
5f226161
zucchini-nlp
zucchini-nlp zucchini-nlp requested a review from ArthurZucker ArthurZucker 1 year ago
gante
ArthurZucker
ArthurZucker
ArthurZucker approved these changes on 2024-08-05
zucchini-nlp
ArthurZucker
zucchini-nlp run slow tests & fix + codestyle
f5af6a28
zucchini-nlp
zucchini-nlp one more falcon fix for alibi
21b45c5d
gante
zucchini-nlp zucchini-nlp merged a30c865f into main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone