Pali gemma modeling #1895

Narsil merged 29 commits into main from pali-gemma-modeling
drbh
drbh feat: load and query model
5fd72ed0
drbh feat: improve config and refactor
b07b53ef
drbh fix: debugging
23294344
drbh fix: adjust siglip attention
e13c08f5
drbh fix: debug avoid scaling embed
d503007f
drbh fix: adjust image and text merge logic
36fb4b5a
drbh fix: typo and lint
4df1b25d
drbh fix: adjust inputs_embeds passed to language model and debug
6e8a2110
drbh fix: prefer gemma rotary embed and split attention weight
5b3b8fd7
drbh fix: small test tweak
9b9614ce
Narsil Don't break what's not broken.
ebbe7edc
Narsil Back functional gemma.
67e833ce
Narsil Fixed PaliGemma.
c119ac4d
drbh fix: apply paligemma template conditionally
d6e306c2
drbh fix: improve pali test and add snapshot
70713fc2
drbh fix: default add special tokens to avoid vlm regressions
17ac93ef
Narsil
Narsil commented on 2024-05-15
Narsil
Narsil commented on 2024-05-15
Narsil Working integration-tests.
65bc0aaa
Narsil Fixed.
1bcaf8f5
Narsil Small updates.
e8d02188
Narsil Installing git.
79b15feb
Narsil Revert "Installing git."
ec926013
Narsil Revert "Revert "Installing git.""
81e7aacb
Narsil Trying to understand the weird failure.
368c057c
Narsil Change the dockerfile. It builds locally, something might be up in AWS
dc0b8d76
Narsil DEbugging this nightmare.
f3f71401
Narsil Using updated runner.
f8337a9e
Narsil Another attempt.
fcb62c71
Narsil Sshing a cuda 12.4
90059707
Narsil Upgrade mamba.
7f97fdac
Narsil
Narsil approved these changes on 2024-05-16
Narsil Narsil merged 40213c95 into main 1 year ago
Narsil Narsil deleted the pali-gemma-modeling branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone