Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
microsoft/DeepSpeed
Pull Requests
Commits
Open
Closed
Create COMMITTERS_RESPONSIBILITY.md
#7300 opened 2025-05-21 14:25 by
PKUWZP
HF2UCP: Converting a `pytorch_model.bin` or `.safetensors` checkpoint to UCP
#7212 opened 2025-04-10 10:13 by
Schwidola0607
gather output layout support for column parallel
#7181 opened 2025-03-28 03:18 by
inkcherry
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
#7130 opened 2025-03-11 08:54 by
cyr0930
[Draft] Add support for seq split in Domino
#7111 opened 2025-03-04 21:19 by
duanhx1037
Update Domino for Llama3
#7084 opened 2025-02-26 20:08 by
shenzheyu
Fix, pipeline model with moe cause error when send grad
#7055 opened 2025-02-19 11:53 by
wukong1992
Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py`
#7033 opened 2025-02-13 18:10 by
loadams
Enabled high-performance Automatic Tensor Parallelism (auto TP) for the MoE models on multiple GPUs/HPUs
#6964 opened 2025-01-21 08:18 by
gyou2021
[FPDT] Support FPDT Based on Intel Backend
#6956 opened 2025-01-16 08:38 by
YizhouZ
Update sharded_moe.py to support top2 gate with Tutel
#6948 opened 2025-01-14 20:11 by
xenshinu
Fix: forbid repeated deepspeed.initialize on training objects
#6874 opened 2024-12-16 00:18 by
traincheck-team
Training ops kernels: Speeding up the Llama-based MoE architectures
#6734 opened 2024-11-08 23:21 by
RezaYazdaniAminabadi
Update MII tests to support transformers latest
#6686 opened 2024-10-29 17:27 by
loadams
Support the parallel conversion from ZeRO checkpoints to FP32/FP16/BF16 param weight
#6655 opened 2024-10-23 03:51 by
xylian86
modify_load_save_model
#6626 opened 2024-10-15 03:22 by
ssklzx
Improve consistency of zero_grad
#6554 opened 2024-09-18 20:27 by
tohtana
Unpin tests that previously used a pinned version of transformers
#6387 opened 2024-08-20 21:16 by
loadams
Hybrid Offloading for ZeRO3
#5625 opened 2024-06-07 01:45 by
tohtana
Fix deadlock in PipeEngine._exec_recv_grads
#5518 opened 2024-05-10 02:45 by
i4never
Make the quantized data shape compatible with original tensor shape
#5483 opened 2024-04-30 05:05 by
sfc-gh-reyazda
uniform deepspeed overflow check
#5424 opened 2024-04-16 22:40 by
GuanhuaWang
Adding DS Feature API in accelerator
#5423 opened 2024-04-16 20:54 by
duli2012
Update names of CPU Adam/Adagrad/Lion params to better match torch/GPU ops.
#5382 opened 2024-04-08 21:56 by
loadams
Disable compile for Z3 hook function
#5325 opened 2024-03-28 00:38 by
tohtana
Disable torch.nn.init when counting parmeters in initializing PipelineModule
#5258 opened 2024-03-12 06:07 by
tanconghui
Set tp to 1 when MPU is None for bf16 optimizer
#5245 opened 2024-03-08 23:42 by
samadejacobs
apply reduce_scatter_coalesced op
#5224 opened 2024-03-04 12:50 by
inkcherry
Loadams/cpu inf v0 docker
#5137 opened 2024-02-15 18:51 by
loadams
move CPU_Accelerator --> Xeon_Accelerator
#5126 opened 2024-02-13 18:25 by
mrwyattii
Newer
Older