transformers
950cfb0b - Fix PaliGemma Pad Token Masking During Training #35855 (#35859)

Commit

1 year ago

Fix PaliGemma Pad Token Masking During Training #35855 (#35859) * change order of unmasking of tokens * library import * class setup * test function * refactor * add commit message * test modified * explict initiliasation of weights + made model smaller * removed sepete testing file * fixup * fixup core * test attention mask with token types * tests fixup * removed PaliGemmaAttentionMaskTest class --------- Co-authored-by: sambhavnoobcoder <indosambahv@gmail.com>

References

#35859 - Fix PaliGemma Pad Token Masking During Training #35855

Author

sambhavnoobcoder

Parents

1614d196

transformers 950cfb0b - Fix PaliGemma Pad Token Masking During Training #35855 (#35859)

transformers
950cfb0b - Fix PaliGemma Pad Token Masking During Training #35855 (#35859)