[zero_to_fp32] adapt to 4-bytes alignment in z2 #1372
[zero_to_fp32] adapt to 4-bytes alignment in z2
d8077876
align both sides
a0080589
cleanup
a3bc8d98
adjust the existing test to reproduce the bug
08b41a8d
test only on one gpu
a38e3ae8
handle the edge case of param with 1 element
9bee134f
Merge branch 'master' into zero_to_fp32_z2_align4
aa049500
Merge branch 'master' into zero_to_fp32_z2_align4
3dc0083c
remove the buffer
592a7d79
Merge branch 'zero_to_fp32_z2_align4' of github.com:stas00/DeepSpeed …
7ca1c7f8
tjruwase
approved these changes
on 2021-09-16
jeffra
approved these changes
on 2021-09-16
jeffra
merged
30537e71
into master 4 years ago
stas00
deleted the zero_to_fp32_z2_align4 branch 4 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub