transformers
Fix GPT-NeoX-20B past handling, attention computation
#17811
Merged

Fix GPT-NeoX-20B past handling, attention computation #17811

zphang
zphang zphang force pushed 3 years ago
sgugger
sgugger approved these changes on 2022-06-21
HuggingFaceDocBuilderDev
sgugger
patrickvonplaten
patrickvonplaten commented on 2022-06-21
patrickvonplaten
patrickvonplaten commented on 2022-06-21
patrickvonplaten
patrickvonplaten approved these changes on 2022-06-21
zphang zphang force pushed 3 years ago
zphang
sgugger
zphang Fix GPT-NeoX-20B past handling, swap attention computation to hopeful…
50f66010
zphang 20B tests
d2e9de90
zphang zphang force pushed to d2e9de90 3 years ago
sgugger sgugger merged 205bc415 into main 3 years ago
sgugger

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone