[new model] Add zaya1 vl #46011
zaya1 support
b35c5e08
add test
d26fffc9
update example
8191d397
new config
c125ef31
remove empty line
c90df6f3
pass ci
b90759f1
modify config, laguna-sytle rope
7e299992
use existing cache
cf083aa1
cca refine + use llama attn
69d09f3f
use dict for 2d/4d mask
d936d54a
optimize, reuse existing code
733e687c
inherit from AfmoeForCausalLM,
eb7c8cc7
checkpoint conversion
4d5bda4e
align with official implement, check 74b conversion
f3e8e02c
easier test
f4f206c5
Merge branch 'main' into add_zaya1
059912d6
remove mapping since we convert the ckpt
7c48ee10
use default_swa_theta
498c2522
update date
3d630612
temp init
4d742969
modular
d77d5d47
better residual scaling
1c16fecb
better cache
3f53fbca
ops forget init again
dc7ac50d
better naming
8be4b1ee
llama decoderlayer
7bb5122a
improve
b315ae07
update date
0df3204d
Add ZAYA1-VL model support
b53060fb
improve
96c275f0
JJJYmmm
marked this pull request as draft 36 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub