[zero] faster flatten/unflatten (cpp version) (#910)
* faster flatten/unflatten with apex
* switch to cpp flatten/unflatten
* style
* better comment
* missing import
* switch to build ops at run time
* fixes
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>