autoTP for fused qkv weight (#3844)
* autoTP for fused qkv weight
* fix format
* clean up
* clean up
* clean up
* update
* make logic flow to util and move to file
* fix formatting
* remove empty line
---------
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>