DeepSpeed
ccfdb84e - FP6 quantization end-to-end. (#5234)

Commit
1 year ago
FP6 quantization end-to-end. (#5234) The user interface: https://github.com/microsoft/DeepSpeed-MII/pull/433 nv-a6000 ci running against the MII branch linked above is [here](https://github.com/microsoft/DeepSpeed/actions/runs/8192124606) Co-authored-by: Zhen Zheng [zhengzhen@microsoft.com](mailto:zhengzhen@microsoft.com) Co-authored-by: Shiyang Chen [csycfl@gmail.com](mailto:csycfl@gmail.com) Co-authored-by: Arash Bakhtiari [abakhtiari@microsoft.com](mailto:abakhtiari@microsoft.com) Co-authored-by: Haojun Xia [xhjustc@mail.ustc.edu.cn](mailto:xhjustc@mail.ustc.edu.cn) --------- Co-authored-by: ZHENG, Zhen <zhengzhen.z@qq.com> Co-authored-by: Shiyang Chen <csycfl@gmail.com> Co-authored-by: Haojun Xia <xhjustc@mail.ustc.edu.cn> Co-authored-by: Arash Bakhtiari <arash@bakhtiari.org> Co-authored-by: Michael Wyatt <mrwyattii@gmail.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Parents
Loading