transformers
950cfb0b - Fix PaliGemma Pad Token Masking During Training #35855 (#35859)

Commit
304 days ago
Fix PaliGemma Pad Token Masking During Training #35855 (#35859) * change order of unmasking of tokens * library import * class setup * test function * refactor * add commit message * test modified * explict initiliasation of weights + made model smaller * removed sepete testing file * fixup * fixup core * test attention mask with token types * tests fixup * removed PaliGemmaAttentionMaskTest class --------- Co-authored-by: sambhavnoobcoder <indosambahv@gmail.com>
Parents
Loading