[DirectML] Broadcast NC-dims for Tensors A&B in DynamicQuantizeMatMul (#21298)
### Description
[DirectML] Broadcast NC-dims for Tensors A&B in DynamicQuantizeMatMul
The DynamicQuantizeMatMul allows input tensors in NCHW format, and
DirectML requires that input tensors share the same batch and channel
dimensions. Tensors A and B should be broadcast (if possible) to the
corresponding output NC dims.
### Motivation and Context
Certain models which use DynamicQuantizeMatMul hit a crash when the NC
dims are intended to be broadcast.
---------
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>