First draft of RWKV-4
3e0855b7
Add support for generate
db50d8b8
Style post-rebase
b6273e9e
sgugger
force pushed
from
c483bdf5
to
b6273e9e
2 years ago
Properly use state
7b819307
Write doc
400147ec
Fix doc
86035324
More math
bc0cd7cb
Add model to README, dummies and clean config
77365843
Fix init
a6aa9327
multiple fixes:
a50d49cc
Merge branch 'main' into add_rwkv
1ca42ad9
correct tokenizer
848ccf81
Merge branch 'add_rwkv' of https://github.com/huggingface/transformer…
a33c5aa8
some tweaks
b679baa8
Merge remote-tracking branch 'upstream/main' into add_rwkv
6ce80d2a
fix CI tests
10b5b81d
fix conversion script
ffb55f99
add slow tests + more fixes on conversion script
7fd5702c
add another test
e52be94d
final fixes
3c02506a
change single name variable
69774da9
add mock attention mask for pipeline to work
d52d6dbc
correct eos token id
0d758e13
fix nits
df44a606
add checkpoints
9e3efc5a
Apply suggestions from code review
0b095c79
add `tie_word_embeddings` in docstring
f912a0d5
change tensor name
2a6dd617
fix final nits
4fdbeefa
Merge remote-tracking branch 'upstream/main' into add_rwkv
dc221691
Trigger CI
d1c2b15e
sgugger
merged
b4d4d6fe
into main 2 years ago
sgugger
deleted the add_rwkv branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub