DeepSpeed
Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py`
#7033

Open

Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py` #7033

loadams wants to merge 76 commits into master from loadams/pyproject-toml

stas00 commented on 2025-02-14

loadams changed the title ~~Add pyproject.toml~~ Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py` 362 days ago

loadams marked this pull request as ready for review 356 days ago

loadams requested a review from

jeffra 356 days ago

loadams requested a review from

sfc-gh-mwyatt 356 days ago

loadams requested a review from

tjruwase 356 days ago

[ROCm] Enable fp_quantizer on ROCm (#7027)

230f479e

Add pyproject.toml

130c11c4

Formatting fix

b2e16cbc

add gds chinese blog (#7034)

72bfb702

Add chinese blog for deepspeed windows, and fix format (#7035)

48c02ad4

AIO on ROCM (#7023)

4996cae5

Update setuptools min requirement

52709fa2

Switch build to legacy

9eb0618b

Add no-build isolation

36ce3737

Control trace cache warnings (#7039)

a0ff11a4

Update CUDA compute capability to support Blackwell (#7047)

c574b454

Update setup.py handling of ROCm cupy (#7051)

5326873f

nv-ds-chat breaks with latest transformers (#7052)

c22be1ab

Test with non legacy backend

2873a11f

Need legacy backend to execute setup.py, though it executes it differ…

4e884634

Update to actually use legacy backend

6d837ca1

Rename aio_thread_count to intra_op_parallelism (#7056)

5b2f7135

add autoTP training zero2 tests (#7049)

c0f4235d

Fix, bf16 optimizer remove dup loop (#7054)

01dd3187

Update version.txt after 0.16.4 release (#7063)

41718ada

fix an outdated doc wrt CUDA_VISIBLE_DEVICES (#7058)

c83ade6f

Tecorigin sdaa accelerator (#6903)

cc79d173

Handle special case of libuv for Windows (#7064)

aa63ba4f

Update README with info on newest accelerator (#7065)

aaf86d68

Bug Fix for offload_states API (#7050)

630864d3

Add all imports from setup.py to pyproject.toml

5a2eceae

Fix TOCTOU issues, switch to fstat (#7067)

ac52baec

config torch to avoid graph breaks caused by logger (#6999)

fd5273f3

Fix meta load tensor imcompatible issue (#7073)

c91d2256

Replace calls to `python setup.py sdist` with `python -m build --sdis…

80ff4201

Update workflow to not do build isolation

355e9c5e

Add more requirements

00addaa7

Revert "Handle special case of libuv for Windows (#7064)" (#7076)

432e71dc

Add DeepseekV3 AutoTP. (#7045)

5ab85893

Add pyproject.toml into comments

f325c92a

Improve inference tutorial docs (#7083)

9c4c4447

Pin transformers version on tests that use latest. (#7085)

5ebe9bdc

Update README.md with ICS '23 MoE paper link (#7087)

9da1b336

Update parallelism for nv-torch-latest/nightly tests due to more GPUs…

f9632484

Remove workflows for very old torch versions (#7090)

8254e1b3

Use new dlpack api; Formatting fixes (#7101)

a7ab4ab7

Avoid graph breaks by disabling sourceless calls in instrument_w_nvtx…

6eb5baa7

Avoid graph breaks in torch.compile caused by inner classes in the ba…

feeaa2e4

Only run pre-commit on the changes (#7106)

7b7e1122

Avoid graph break due to unsupported frozenset (#7105)

674f9bd5

Fix fused_qkv print model ValueError (#7109)

240c2a7b

Update references to new X/Twitter handle (#7110)

1c35241b

Update gaudi2 nightly,ci to latest 1.20.0 build (#7093)

ec2e93e3

fix keep_module_on_host (#7112)

c7934014

Add sequential pytest mark to TestNVMeCheckpointing to resolve pytest…

62abd88e

Training multiple models (#7018)

065084be

Update CONTRIBUTING.md to reflect changes from CLA to DCO (#7135)

e93f0d16

Avoid missing attr error (#7133)

eaeae790

Add conditional expression (#7119)

8df6245e

Unpin transformers version for most workflows (#7139)

6de20f65

Conditionally quote env vars (#7071)

e0188ac3

Correct the BACKWARD_PREFETCH_SUBMIT mismatch (#7120)

ea61c6ca

Enhance Gaudi2 CI/Nightly Coverage with Model Parallelism and Linear …

9d753116

Update container version that runs on A6000 tests. (#7153)

4ca7ba57

fix leak of z3 buffer

2db922fe

hf tp+zero training doc. (#7151)

78ec0254

Avoid graph break by removing redundant requires_grad attr change (#7…

4a851f35

Add destroy to tests to free memory (#7160)

6adcf2e3

[NFC] Typo fix in SP layer. (#7152)

7b7ac9e3

Link AutoTP blog in the front page (#7167)

31ec2b77

loadams force pushed from 705edb34 to 31ec2b77 328 days ago

loadams requested a review from

tohtana 328 days ago

loadams requested a review from

jomayeri 328 days ago

loadams requested a review from

hwchen2017 328 days ago

Merge branch 'master' into loadams/pyproject-toml

e40df22d

Remove unneeded requires in build system declaration

4c32a9d0

sfc-gh-mwyatt approved these changes on 2025-03-25

Add build to the pyproject

c4b24fd3

Add no torch build triggers

b330b4e5

Remove no build isolation from nv-torch-latest

3601c293

Merge branch 'master' into loadams/pyproject-toml

e0d9ba44

Merge branch 'master' into loadams/pyproject-toml

42e42a08

Merge branch 'master' into loadams/pyproject-toml

3c94f513

agronholm commented on 2025-04-09

Merge branch 'master' into loadams/pyproject-toml

221f2be6

Merge branch 'master' into loadams/pyproject-toml

dfcc24d3

Merge branch 'master' into loadams/pyproject-toml

b4ed94a6

Reviewers

sfc-gh-mwyatt

stas00

agronholm

jeffra

tjruwase

tohtana

jomayeri

hwchen2017

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

DeepSpeed Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py` #7033 Open

Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py` #7033

DeepSpeed
Add `pyproject.toml` with legacy build backend to keep most logic in `setup.py`
#7033

Open