Add EmbedLayerNormalization and SkipLayerNormalization ops for bert optimization (#2012)
* Add Embed Layer Normalization and Skip Layer Normalization ops for bert optimization.
* add float16 test for skiplayernorm
* Add test for EmbedLayerNormalization op
* fix cpu build error
* fix build warning
* update HasCudaEnvironment function
* handle cuda error