Add micro-benchmarks for Attention and SkipLayerNormalization ops. (#10798)
* Add micro-benchmarks for Attention and SkipLayerNormalization ops.
* Add choices for argument provider and precision.
* Automatically select CUDA or ROCM execution provider.