MP ZeRO++ #3954

HeyangQin merged 51 commits into master from HeyangQin/mixed_precision_lora
HeyangQin
HeyangQin zero++ tutorial PR (#3783)
df1859d6
zhiruiluo [Fix] _conv_flops_compute when padding is a str and stride=1 (#3169)
d81a6ad6
cli99 fix interpolate flops compute (#3782)
a8c182a4
CaffreyR use `Flops Profiler` to test `model.generate()` (#2515)
c4c442f0
jeffra revert PR #3611 (#3786)
fc9e1ee0
jeffra bump to 0.9.6
40045dc7
HeyangQin ZeRO++ chinese blog (#3793)
49a0a1bb
jeffra remove staging trigger (#3792)
2c62cb4c
stephen-youn DeepSpeed-Triton for Inference (#3748)
4dc65f7b
HeyangQin ZeRO++ (#3784)
e1119d8b
HeyangQin adding zero++ to navigation panel of deepspeed.ai (#3796)
01b843aa
tohtana Add ZeRO++ Japanese blog (#3797)
319b64ed
cli99 Bug Fixes for autotuner and flops profiler (#1880)
b4a2c0af
cmikeh2 Missing strided copy for gated MLP (#3788)
b7e1010b
jomayeri Requires grad checking. (#3789)
e5b1eadb
jeffra bump to 0.10.0
9c756cf8
rraminen Fix Bug in transform.cu (#3534)
a204edc7
stephen-youn bug fix: triton importing error (#3799)
f6e2e38b
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
c1a7d3cc
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
65ed5483
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
d7ac3296
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
83f1102e
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
16555b27
jeffra Merge branch 'master' of github.com:microsoft/DeepSpeed
9d7b654a
HeyangQin init commit for mixed precision lora
2efb73dd
HeyangQin fix format
1147885c
HeyangQin patch _allgather_params & minor fixes
1bec51f5
HeyangQin make sure initial quantization are finished
5b3c460a
HeyangQin make sure dequantization is finished
ec1f154c
HeyangQin skip quantization for small parameters
9d531688
HeyangQin fix format
8fe8c87d
HeyangQin HeyangQin requested a review from jeffra jeffra 2 years ago
HeyangQin HeyangQin requested a review from tjruwase tjruwase 2 years ago
HeyangQin HeyangQin requested a review from samyam samyam 2 years ago
HeyangQin HeyangQin requested a review from mrwyattii mrwyattii 2 years ago
HeyangQin Merge branch 'master' into HeyangQin/mixed_precision_lora
cabf59c0
HeyangQin remove unused async_op
b3ad4253
HeyangQin Merge branch 'HeyangQin/mixed_precision_lora' of https://github.com/m…
7b2b6a4b
HeyangQin lazy load of quantizer kernels
a06c5644
HeyangQin add mixed precision lora tutorial
94cf3c4a
HeyangQin Merge branch 'master' into HeyangQin/mixed_precision_lora
ce96d9a0
HeyangQin cleanup mics
b1cb5973
HeyangQin cleanup mics
3470949c
HeyangQin Merge branch 'HeyangQin/mixed_precision_lora' of https://github.com/m…
e0e8cf49
HeyangQin replace get_accelerator().current_device()
c25cf6b0
HeyangQin Merge remote-tracking branch 'origin/master' into HeyangQin/mixed_pre…
aa4f28a2
HeyangQin Merge remote-tracking branch 'origin/master' into HeyangQin/mixed_pre…
f7cb5493
HeyangQin add kwargs to mics
d5013092
HeyangQin fix format
b5a41fab
HeyangQin HeyangQin changed the title Mixed precision LoRA release Mixed precision ZeRO++ release 2 years ago
HeyangQin HeyangQin changed the title Mixed precision ZeRO++ release MP ZeRO++ 2 years ago
HeyangQin seperate code and tutorial
74c27605
HeyangQin Merge branch 'master' into HeyangQin/mixed_precision_lora
9f68cdad
awan-10
awan-10 approved these changes on 2023-08-18
HeyangQin HeyangQin enabled auto-merge 2 years ago
awan-10 Merge branch 'master' into HeyangQin/mixed_precision_lora
f8020116
HeyangQin Merge branch 'master' into HeyangQin/mixed_precision_lora
a6bd4544
awan-10 Merge branch 'master' into HeyangQin/mixed_precision_lora
3d527b24
HeyangQin fix _all_gather in zero3
9e277ba8
HeyangQin HeyangQin merged 7711bdbb into master 2 years ago
jeffra jeffra deleted the HeyangQin/mixed_precision_lora branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone