Add blog for ZenFlow (#7463)
This PR adds a blog post and images for ZenFlow, introducing its design,
benefits, and usage. The blog explains how ZenFlow improves GPU
utilization by overlapping computation and communication during
offloaded training.
See also:
#7391 – core ZenFlow implementation.
[#982](https://github.com/deepspeedai/DeepSpeedExamples/pull/982) - –
benchmarking and fine-tuning example.
---------
Signed-off-by: Tingfeng Lan <erc8gx@virginia.edu>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>