DeepSpeed
Enable torch.compile with ZeRO (Experimental)
#4878
Merged

Enable torch.compile with ZeRO (Experimental) #4878

tohtana merged 49 commits into master from tohtana/compile-zero
tohtana
add option to run torch.compile
91c01ab6
improve compile helper
49c7acd1
fix compile wrapper to make modules() work
bbeb38cf
add torch.compiler-disable to comm module
719cc79d
move options for torch.compile to ds config
3ea9b444
Merge branch 'master' into tohtana/compile-zero
eb9d4e06
rename module and wrap decorator
fbae0278
fix validation of compile config
4f8f86d6
avoid reference to torch._dynamo when torch has no support
d83963bd
fix custom backend for test
6920ab60
fix validation
c3429a68
tjruwase
stas00
stas00
stas00
stas00
stas00
tohtana
stas00
tohtana
stas00
refactor config for torch.compile
bfafb886
tohtana Merge branch 'master' into tohtana/compile-zero
3c13fd41
tohtana Merge branch 'master' into tohtana/compile-zero
9e63c956
fix validation of compiler config
ff9c1efb
fix access to wrapped model
26b7f254
add test for api to set torch compile options
48d2453f
rename util module
d5584b00
fix import
b9157ac5
tohtana
tohtana tohtana marked this pull request as ready for review 1 year ago
tohtana tohtana requested a review from tjruwase tjruwase 1 year ago
tohtana tohtana requested a review from mrwyattii mrwyattii 1 year ago
tohtana tohtana requested a review from awan-10 awan-10 1 year ago
tohtana tohtana requested a review from loadams loadams 1 year ago
tohtana Merge branch 'master' into tohtana/compile-zero
c19bf97a
delay reduce-scatter for z3 leaf modules
93268f30
tohtana Merge branch 'master' into tohtana/z3_moe_bwd
a56ffec0
stas00
add comment to config class
2a5e741d
tohtana Merge branch 'master' into tohtana/compile-zero
ec919251
add api to get leaf modules
ca5cff6a
wconstab
tohtana
wconstab
tohtana
tjruwase Merge branch 'master' into tohtana/compile-zero
f6151380
tjruwase Merge branch 'master' into tohtana/z3_moe_bwd
08770b8c
wconstab
tohtana
add api to set a function to run torch.compile
3e5658be
tohtana
mrwyattii Merge branch 'master' into tohtana/compile-zero
5d9992e2
tjruwase
tjruwase commented on 2024-01-31
tjruwase
tjruwase commented on 2024-01-31
tjruwase
tjruwase commented on 2024-01-31
tjruwase
tjruwase commented on 2024-01-31
tjruwase
tjruwase commented on 2024-01-31
tjruwase
refactor compile config
a3c0e5d3
lift is_compile_supported up to use as `deepspeed.is_compile_supporte…
95f4f344
avoid overwriting backend fn in validator
1932b787
add tests combining compile and zero
ca856052
rename test modules
19dd4548
tohtana Merge branch 'master' into tohtana/z3_moe_bwd
fccbd950
tohtana Merge branch 'master' into tohtana/z3_moe_bwd
da1f41d3
use no zero + no compile as baseline for tests
d8c0a14c
disable memory_efficient_linear when torch.compile is enabled
ca419b4d
pass only tensors to z3 hooks to prevent dynamo from displaying errors
5e3a0707
Merge branch 'tohtana/z3_moe_bwd' into tohtana/compile-zero
06379688
Merge branch 'master' into tohtana/compile-zero
7131d6e5
fix exception used in test
96c8647f
increse tolerance in tests
0b3dae9b
add check for bf16
94cc97a5
enable accelerator check for bf16
eb27b9d3
tohtana Merge branch 'master' into tohtana/compile-zero
c1e10e5e
tjruwase
tjruwase approved these changes on 2024-02-05
mrwyattii update DistributedTest to work with torch.compile tests
c2ba829d
mrwyattii remove unused global
82f80d12
mrwyattii
mrwyattii approved these changes on 2024-02-05
tohtana Merge branch 'master' into tohtana/compile-zero
cb540d35
tohtana tohtana enabled auto-merge 1 year ago
tohtana tohtana merged c3cfe96b into master 1 year ago
oraluben
oraluben commented on 2024-02-18

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone