[tests] expand flex-attn test for vision models (#38434)
* expand the test for VLMs
* typo
* mark models `supports_flex` + expand test for additional kwargs
* flex attn for refactored vision models
* fix copies
* fix
* unskip
* style
* address comments