Enable universal checkpoint for zero stage 1 (#4516)
* Enable uni_ckpt for z1
* Remove logging fix to seperate PR. Relocate conversion script to avoid logging circular import issue
* Formatting fix
* PR feedback
* Handle replicated params
* Detect bf16_optimizer
* Docs
* Fix docs