DeepSpeed
a04480e1 - Fix the half-precision version of CPU-Adam (#2032)

Commit
3 years ago
Fix the half-precision version of CPU-Adam (#2032) * Fix the half-precision version of CPU-Adam * remove unexpected return * fix the increase width (fp32/fp16) * support fp16 tests for cpu-adam * fix the fp16 data-loading * change unit-test for fp16 check & slight change to parameter size * fix for numpy error Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Parents
  • csrc/includes
    • File
      cpu_adam.h
    • File
      simd.h
  • tests/unit
    • File
      test_cpu_adam.py