FP6 quantization end-to-end. (#5234)
The user interface: https://github.com/microsoft/DeepSpeed-MII/pull/433
nv-a6000 ci running against the MII branch linked above is
[here](https://github.com/microsoft/DeepSpeed/actions/runs/8192124606)
Co-authored-by: Zhen Zheng
[zhengzhen@microsoft.com](mailto:zhengzhen@microsoft.com)
Co-authored-by: Shiyang Chen [csycfl@gmail.com](mailto:csycfl@gmail.com)
Co-authored-by: Arash Bakhtiari
[abakhtiari@microsoft.com](mailto:abakhtiari@microsoft.com)
Co-authored-by: Haojun Xia
[xhjustc@mail.ustc.edu.cn](mailto:xhjustc@mail.ustc.edu.cn)
---------
Co-authored-by: ZHENG, Zhen <zhengzhen.z@qq.com>
Co-authored-by: Shiyang Chen <csycfl@gmail.com>
Co-authored-by: Haojun Xia <xhjustc@mail.ustc.edu.cn>
Co-authored-by: Arash Bakhtiari <arash@bakhtiari.org>
Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>