Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
microsoft/DeepSpeed
Pull Requests
Commits
Open
Closed
move CPU_Accelerator --> Xeon_Accelerator
#5126 opened 2024-02-13 18:25 by
mrwyattii
Add HIP device abstraction, update Triton skip logic
#5120 opened 2024-02-12 20:42 by
lekurile
TEST: PR HIP-ifying and running the bias_activations kernel on AMD
#5082 opened 2024-02-05 18:52 by
lekurile
Workflow for AutoTP
#4961 opened 2024-01-16 10:05 by
delock
Support Triton 2.2+
#4937 opened 2024-01-11 17:45 by
loadams
Add Cache to Comm Group
#4849 opened 2023-12-20 22:18 by
cmikeh2
Support FP16 CpuAdam + Zero Stage 3
#4771 opened 2023-12-04 21:39 by
lz1oceani
support autoTP with weight only quantization in DS inference path
#4750 opened 2023-11-29 05:54 by
ftian1
SP Comm-optimization: fuse query, key, and value all-2-all for better SP perforamnce
#4735 opened 2023-11-28 00:45 by
RezaYazdaniAminabadi
Add simple layout for creating multi-dimensional parallelism
#4706 opened 2023-11-20 17:00 by
RezaYazdaniAminabadi
Add more weight only quantization algorithms into DeepSpeed inference.
#4577 opened 2023-10-27 06:00 by
ftian1
Fixed bug with hybrid engine generation when inference_tp_size > 1
#4493 opened 2023-10-10 07:55 by
hxdtest
Fix assert on Lamb optimizers with BF16
#4451 opened 2023-10-04 17:50 by
loadams
Destroy ZeRO
#4383 opened 2023-09-21 21:35 by
jomayeri
DS-Inference Quantization refresh: Fix several issues and add more features
#4351 opened 2023-09-17 18:43 by
RezaYazdaniAminabadi
Switch modeling to use transformers and torch version for BERT
#4329 opened 2023-09-13 23:28 by
loadams
Remove symlinks
#4323 opened 2023-09-13 18:41 by
mrwyattii
Refactor the injection to accept policy when using kernels
#4267 opened 2023-09-05 17:38 by
RezaYazdaniAminabadi
fix: fixed the communication problem of pp when using sequence parallel
#4228 opened 2023-08-28 07:53 by
LiuXTao
Allow TiedLayerSpec to have multiple tied weights
#4216 opened 2023-08-24 20:14 by
zphang
Fix for lm head weights for models such as llama
#4088 opened 2023-08-04 07:50 by
puneeshkhanna
llama with HE
#4087 opened 2023-08-04 03:15 by
ciayomin
fix bloom bugs about inference and TP
#3973 opened 2023-07-17 14:01 by
dawson-chen
Fix the call to get_param_coordinator() in _end_of_forward_hook()
#3972 opened 2023-07-17 12:22 by
mmhab
autoTP for HE
#3957 opened 2023-07-13 23:30 by
molly-smith
Pipe Engine Reduce High Dimension Output tensor fix
#3774 opened 2023-06-20 19:46 by
abhilash1910
Update vae.py
#3761 opened 2023-06-16 15:16 by
mzamini92
Adding assertion for mp_group in HE.
#3740 opened 2023-06-12 17:44 by
jomayeri
Addressing ipg Buffer Data Race Condition in Zero Stage2
#3727 opened 2023-06-09 11:17 by
xxr3376
Asymmetric quant algorithm update
#3696 opened 2023-06-06 21:48 by
cmikeh2
Newer
Older