DeepSpeed
[AutoTP] Make AutoTP work when num_heads not divisible by number of workers
#4011
Merged

[AutoTP] Make AutoTP work when num_heads not divisible by number of workers #4011

delock
delock allow number of heads not divisible by number of ranks
0706acd7
delock get num_heads from model config, more robust
0bf785f8
delock simplify logic where num_head itself is sharded
72b9e1ad
delock delock requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
delock delock requested a review from jeffra jeffra 2 years ago
delock delock requested a review from mrwyattii mrwyattii 2 years ago
delock delock requested a review from awan-10 awan-10 2 years ago
delock delock requested a review from cmikeh2 cmikeh2 2 years ago
delock delock requested a review from arashb arashb 2 years ago
delock name tweaks
5ed9a566
delock make code more robust where num_attention_heads may not be defined in…
73f499d9
delock Merge branch 'master' into gma/uneven_heads
48322c7b
delock Merge branch 'master' into gma/uneven_heads
f14e2909
mrwyattii
mrwyattii requested changes on 2023-07-24
loadams Merge branch 'master' into gma/uneven_heads
b62317cb
delock support num_key_value_heads < num_attention_heads which is used by ll…
12c06286
delock add test for 5 ranks
8f23d9bf
delock delock requested a review from tjruwase tjruwase 2 years ago
delock change odd rank # to 3 to avoid test skip
9c53bd74
tjruwase Merge branch 'master' into gma/uneven_heads
413224b2
delock
molly-smith molly-smith requested a review from molly-smith molly-smith 2 years ago
delock Merge branch 'master' into gma/uneven_heads
78d6667a
delock add get_shard_size function
27fde308
delock modify sharding mechanism according to latest auto TP
8e1fd277
delock Merge branch 'master' into gma/uneven_heads
9a6bc12f
delock fix accuracy issue
2dac94f3
delock Merge branch 'master' into gma/uneven_heads
885f6a34
delock
delock delock requested a review from mrwyattii mrwyattii 2 years ago
molly-smith Merge branch 'master' into gma/uneven_heads
7ffd8112
molly-smith molly-smith assigned molly-smith molly-smith 2 years ago
tjruwase Merge branch 'master' into gma/uneven_heads
40659baf
delock fix format
71f9f401
delock skip tests with fusedqkv
db9db6bf
delock Merge branch 'master' into gma/uneven_heads
72531c06
delock remove skip of fusedqkv tests
9d5eae34
delock
delock skip test fusedqkv with odd number of ranks
25e656db
delock support model with n_heads in model_config
7f6d7f6d
molly-smith Merge branch 'master' into gma/uneven_heads
e3a5b771
delock Merge branch 'master' into gma/uneven_heads
c9ec881a
delock fix TestInjectionPolicy::test[fp32-t5]
f5be2571
delock
inkcherry fix uneven_heads on some fusedqkv types (#12)
b6710401
delock
dc3671
dc3671 commented on 2023-08-30
delock better fix when activation size cannot be divided by number of heads
d59ff222
delock Merge branch 'master' into gma/uneven_heads_rebase
6c3c841d
molly-smith
molly-smith approved these changes on 2023-09-01
molly-smith Merge branch 'master' into gma/uneven_heads
58e8b24a
delock move tp_shard.py under module_inject
4c6b7fa2
delock Merge branch 'master' into gma/uneven_heads
18e1c5d5
delock Add get_num_kv_heads in tp_shard.py
8ef01e28
delock Merge branch 'master' into gma/uneven_heads
9a61fc26
delock Merge branch 'master' into gma/uneven_heads
74870db3
delock
molly-smith Merge branch 'master' into gma/uneven_heads
115cc208
molly-smith molly-smith enabled auto-merge 2 years ago
mrwyattii
mrwyattii requested changes on 2023-09-13
delock Refine according to comments
0781c41e
disabled auto-merge 2 years ago
Head branch was pushed to by a user without write access
mrwyattii
mrwyattii commented on 2023-09-14
mrwyattii remove old comment
194337fe
delock Merge branch 'master' into gma/uneven_heads
47d84caf
mrwyattii Merge branch 'master' into gma/uneven_heads
369eb3e1
delock fix bug in getting num_kv_heads
567fb9ae
delock
molly-smith Merge branch 'master' into gma/uneven_heads
47c83cac
molly-smith molly-smith assigned mrwyattii mrwyattii 2 years ago
tjruwase Merge branch 'master' into gma/uneven_heads
d194ab07
mrwyattii
mrwyattii approved these changes on 2023-10-02
mrwyattii
mrwyattii mrwyattii enabled auto-merge 2 years ago
tjruwase
delock Merge branch 'master' into gma/uneven_heads
6db5ddd0
disabled auto-merge 2 years ago
Head branch was pushed to by a user without write access
delock
delock Merge branch 'up-master' into gma/uneven_heads
698b62a8
delock support uneven sharding of lm_head tensor parallel
d75149fb
delock
delock Merge branch 'master' into gma/uneven_heads
248532db
delock Merge branch 'master' into gma/uneven_heads
a9056fd5
delock
delock Merge branch 'master' into gma/uneven_heads
81bd29f2
delock Merge branch 'master' into gma/uneven_heads
693a9fea
delock Merge branch 'master' into gma/uneven_heads
4c45a5b7
delock Merge branch 'master' into gma/uneven_heads
a7513e17
tjruwase tjruwase enabled auto-merge 2 years ago
tjruwase tjruwase merged f15cccfa into master 2 years ago

Login to write a write a comment.

Login via GitHub

Labels
Milestone