DeepSpeed
Make the quantized data shape compatible with original tensor shape
#5483
Open

Make the quantized data shape compatible with original tensor shape #5483

sfc-gh-reyazda
sfc-gh-reyazda Make the quantized data shape compatible with original tensor shape
085fcd8b
sfc-gh-reyazda sfc-gh-reyazda requested a review from mrwyattii mrwyattii 1 year ago
sfc-gh-reyazda sfc-gh-reyazda requested a review from awan-10 awan-10 1 year ago
sfc-gh-reyazda sfc-gh-reyazda requested a review from arashb arashb 1 year ago
sfc-gh-reyazda
sfc-gh-reyazda
sfc-gh-reyazda change the scale and quantized data format
a83b384f
sfc-gh-reyazda minor fixes
048648dd
sfc-gh-reyazda fix
bf128934
sfc-gh-reyazda minor fix
b18f71f6
sfc-gh-reyazda Merge branch 'master' into fix-quantized-shape
4d6e04ba
sfc-gh-reyazda more fixed
f9244558
sfc-gh-reyazda Merge branch 'fix-quantized-shape' of https://github.com/Snowflake-La…
e03c0f49
nelyahu Improve _configure_optimizer() final optimizer log (#5528)
d9cfba6e
vshekhawat-hlab Enhance testing: Skip fused_optimizer tests if not supported. (#5159)
2bbc6806
foin6 Skip the UT cases that use unimplemented op builders. (#5372)
b3ab6265
rraminen rocblas -> hipblas changes for ROCm (#5401)
4494c86c
rraminen Rocm warp size fix (#5402)
2c0dcac5
deepcharm Optimize zero3 fetch params using all_reduce (#5420)
f53895fe
BacharL CPUAdam fp16 and bf16 support (#5409)
bb146c33
shiyang-weng Fix the TypeError for XPU Accelerator (#5531)
31f11c05
shiyang-weng Fix RuntimeError for moe on XPU: tensors found at least two devices (…
35b48135
BacharL Remove synchronize calls from allgather params (#5516)
cf0ccb5a
deepcharm Avoid overwrite of compiled module wrapper attributes (#5549)
e388056c
TravelLeraLone Small typos in functions set_none_gradients_to_zero (#5557)
5ff0d446
oraluben Adapt doc for #4405 (#5552)
29ab009b
loadams Update to HF_HOME from TRANSFORMERS_CACHE (#4816)
633da3d9
oelayan7 [INF] DSAttention allow input_mask to have false as value (#5546)
9db010e5
deepcharm Add throughput timer configuration (#5363)
bd2b2ef1
Kwen-Chen Add Ulysses DistributedAttention compatibility (#5525)
3c5aa00a
lekurile Add hybrid_engine.py as path to trigger the DS-Chat GH workflow (#5562)
d7f9be61
loadams Update HPU docker version (#5566)
c160d76a
ys950902 [MiCS] Remove the handle print on DeepSpeed side (#5574)
c203830f
loadams Rename files in fp_quantize op from quantize.* to fp_quantize.* (#5577)
5e5c8a7c
loadams Update to fix sidebar over text (#5567)
ff01ade2
nelyahu DeepSpeedCheckpoint: support custom final ln idx (#5506)
83920f6f
adk9 Update minor CUDA version compatibility (#5591)
a6076cf1
tohtana Add slide deck for meetup in Japan (#5598)
9db99709
costin-eseanu Fixed the Windows build. (#5596)
c6f151c1
nelyahu estimate_zero2_model_states_mem_needs: fixing memory estiamtion (#5099)
0bf35115
Liangliang-Ma Fix cuda hardcode for inference woq (#5565)
cca53b0b
inkcherry fix sequence parallel(Ulysses) grad scale for zero0 (#5555)
31815d9c
Liangliang-Ma Add Compressedbackend for Onebit optimizers (#5473)
6ad125e4
vshekhawat-hlab Updated hpu-gaudi2 tests content. (#5622)
9c15b8f7
loadams Pin transformers version for MII tests (#5629)
2e4bc1d3
NirSonnenschein WA for Torch-compile-Z3-act-apt accuracy issue from the Pytorch repo …
e5b4d414
nelyahu stage_1_and_2: optimize clip calculation to use clamp (#5632)
8a4d03c7
penn513 Fix overlap communication of ZeRO stage 1 and 2 (#5606)
5e5b1f7b
sfc-gh-reyazda Merge branch 'master' of https://github.com/Snowflake-Labs/deepspeed …
c47ad5f9
sfc-gh-reyazda remove float8 dtype
277902a1
sfc-gh-reyazda Merge branch 'master' into fix-quantized-shape
74311af7
loadams loadams removed review request from arashb arashb 260 days ago
loadams loadams removed review request from mrwyattii mrwyattii 260 days ago
loadams loadams requested a review from jeffra jeffra 260 days ago
loadams loadams requested a review from tjruwase tjruwase 260 days ago
loadams loadams requested a review from hwchen2017 hwchen2017 260 days ago
loadams loadams removed review request from awan-10 awan-10 260 days ago
sfc-gh-reyazda Merge branch 'master' into fix-quantized-shape
9eb12fbe
sfc-gh-reyazda sfc-gh-reyazda requested a review from tohtana tohtana 258 days ago
sfc-gh-reyazda sfc-gh-reyazda requested a review from loadams loadams 258 days ago
hwchen2017
hwchen2017 commented on 2025-01-13
hwchen2017
hwchen2017 commented on 2025-01-13
hwchen2017
hwchen2017 approved these changes on 2025-01-13

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone