DeepSpeed
A faster and more memory-efficient implementation of `zero_to_fp32`
#6658
Merged

A faster and more memory-efficient implementation of `zero_to_fp32` #6658

loadams merged 12 commits into deepspeedai:master from xu-song:patch-4
xu-song
xu-song xu-song requested a review from tjruwase tjruwase 1 year ago
xu-song xu-song requested a review from awan-10 awan-10 1 year ago
tjruwase
tjruwase commented on 2024-10-23
tjruwase
tjruwase commented on 2024-10-23
tjruwase
tjruwase commented on 2024-10-23
tjruwase
tjruwase commented on 2024-10-23
tjruwase
tjruwase commented on 2024-10-23
tjruwase tjruwase removed review request from awan-10 awan-10 1 year ago
tjruwase tjruwase requested a review from tohtana tohtana 1 year ago
tjruwase
tjruwase
xu-song
tjruwase
xu-song
xu-song xu-song closed this 1 year ago
xu-song xu-song reopened this 1 year ago
xu-song Faster and more memory-efficient impl of zero_to_fp32
ec734974
xu-song xu-song force pushed from e1d12bb6 to ec734974 1 year ago
tjruwase
tjruwase
tjruwase commented on 2024-10-28
xu-song
tjruwase
tjruwase Merge branch 'master' into patch-4
27e8ae27
xu-song add unit test
e0f1d53f
xu-song xu-song requested a review from loadams loadams 1 year ago
xu-song
xu-song add comments
4d005b8a
xu-song fix yapf formatting issue
5684be04
loadams Merge branch 'master' into patch-4
f3f8b271
tjruwase
tjruwase approved these changes on 2024-11-06
loadams Merge branch 'master' into patch-4
b8c9a5e4
xu-song rename lazy_mege to lazy_mode; fix Trailing Whitespace
d2b98b03
xu-song
tjruwase Merge branch 'master' into patch-4
f36925e9
tjruwase Merge branch 'master' into patch-4
5a753b97
tjruwase
loadams Merge branch 'master' into patch-4
af82bae8
loadams Merge branch 'master' into patch-4
517210cd
loadams loadams enabled auto-merge 1 year ago
loadams loadams merged dd402694 into master 1 year ago
NicholasCao

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone