Fix SigLIP casual mask bug (#25360)

Commit

254 days ago

Fix SigLIP casual mask bug (#25360) ### Description  SigLIP architecture inside the vision encoder should not use a causal mask on the attention. This change will fix Phi 4 MM accuracy issues we have seen. ### Motivation and Context  --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

References

#25360 - Fix SigLIP casual mask bug

Author

nenad1002

Parents

1e5fdd12

onnxruntime f19bb3c7 - Fix SigLIP casual mask bug (#25360)

onnxruntime
f19bb3c7 - Fix SigLIP casual mask bug (#25360)