Simplify Tensor Parallel implementation with PyTorch TP #34184
Simplify Tensor Parallel implementation with PyTorch TP
e60fb879
Move tp_plan to config
fd7f7c72
Merge remote-tracking branch 'origin/main' into tp_llama
9224cabd
Lint
79cc524e
Format and warning
a2934b33
kwen2501
force pushed
to
a2934b33
1 year ago
Disable copy-from check
a8fc418c
Conditionally get attr from config
e84a3889
make fix-copies
396d158c
Move base_model_tp_plan to PretrainedConfig
7b346b55
Merge remote-tracking branch 'origin/main' into tp_llama
d60679b5
Merge remote-tracking branch 'origin/main' into tp_llama
dda058af
Move TP into from_pretrained
12fbbe70
Add device context for load
02c8c393
Do not serialize
073c521d
Move _tp_plan setting to post_init
db6e5eeb
kwen2501
force pushed
to
db6e5eeb
1 year ago
Add has_tp_plan
5bb294ec
Add test_tp
290a7f18
Add 'Multi-gpu inference' doc
bd2e89c1
Merge remote-tracking branch 'origin/main' into tp_llama
4892ceff
Add backward support for device type identification
9648f316
Auto-detect accelerator
93ba2835
supports_tp_plan
73524c90
copyright year
f312e551
Merge remote-tracking branch 'origin/main' into tp_llama
ca93bdb9
kwen2501
force pushed
to
ca93bdb9
1 year ago
Merge branch 'main' into tp_llama
dc2672f8
Fix copy
1e27d6f9
kwen2501
force pushed
to
1e27d6f9
1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub