[WIP] Add mammut #641

gpucce wants to merge 53 commits into mlfoundations:main from gpucce:add_mammut
gpucce
gpucce initial commits
8d2589f3
gpucce typos
4e218498
first run
2c967a91
t arg change
53cf4ed2
Merge remote-tracking branch 'upstream/main' into add_mammut
46a89a47
gpucce better generation logic
5827a10e
gpucce Merge remote-tracking branch 'upstream/main' into add_mammut
fcee7c23
gpucce adjust notation
997b7868
gpucce make as with cls at the end
852b2f47
transformer.py: MultimodalTransformer not using init_parameters
88f70baa
gpucce Merge branch 'fix_multimodal_transformer' of https://github.com/iejMa…
ad716d75
gpucce test CI
dcb18d72
gpucce move back ci
491bb5ce
gpucce split pooling
dce72e8c
gpucce Merge remote-tracking branch 'origin/double_att_pool' into fix_multim…
623226c3
gpucce Merge branch 'fix_multimodal_transformer' of https://github.com/iejMa…
ee982c91
gpucce Merge branch 'fix_multimodal_transformer' into add_mammut
e191018c
gpucce Merge branch 'main' into add_mammut
20dc72bc
gpucce fix padding and 0 loss
4b95dd48
gpucce Merge branch 'main' into add_mammut
1c386220
gpucce revert multiple attn pooler
826e7995
gpucce rm white space
b25d6f92
gpucce duplicated init_parameters
5a55161f
gpucce missing is_decoder
26f39c0f
gpucce small improvements
fe6bc7bd
gpucce move mage projection
0ea74acd
gpucce remove useless kwargs
e4258f11
gpucce remove useless kwargs
ddb647ce
gpucce refactor mammut into MultimodalTransformer
a3d32fd5
gpucce fix args
5a29467c
gpucce small improvements
805ad2d5
gpucce inherit init
46da15f7
gpucce make equal
53c6d392
gpucce fix typo
1d46cfa0
gpucce output latents and logits
2abdf120
gpucce add mammut L/14 config
4c8dcee9
gpucce allow text=None in forward
2ca74c10
gpucce better decoder
52e7b3e8
gpucce better decoder
4f0b8321
gpucce Merge branch 'main' into add_mammut
b980b57b
gpucce Merge branch 'main' into add_mammut
dc8d8413
gpucce integrate transformers generate changes
f13b230e
gpucce Merge branch 'main' into merge_main_mammut
29a052fb
gpucce stash pop
e83ac00f
gpucce update generation
717fe7ac
gpucce uniform generation
f64c9952
gpucce make equal
f3912dc5
gpucce add context legnth to mammut
c4bb1986
gpucce add older poolign
ef9071ba
gpucce Merge branch 'merge_main_mammut' into add_mammut
cb619dc2
gpucce Merge branch 'main' into add_mammut
1e14e4c5
gpucce typo and print
b34393ee
gpucce uniform everything
cc503674
JeniaJitsev
gpucce
mehdidc
gpucce
gpucce
mehdidc

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone