transformers
Simplify Tensor Parallel implementation with PyTorch TP
#34184
Merged

Simplify Tensor Parallel implementation with PyTorch TP #34184

ArthurZucker merged 26 commits into huggingface:main from kwen2501:tp_llama
kwen2501
kwen2501 Simplify Tensor Parallel implementation with PyTorch TP
e60fb879
kwen2501
ArthurZucker
ArthurZucker commented on 2024-10-21
ArthurZucker
ArthurZucker
ArthurZucker
ArthurZucker
kwen2501 Move tp_plan to config
fd7f7c72
kwen2501
kwen2501
ArthurZucker
ArthurZucker commented on 2024-10-24
kmehant
kmehant commented on 2024-10-25
kmehant
kmehant commented on 2024-10-25
kwen2501
ArthurZucker
kwen2501 Merge remote-tracking branch 'origin/main' into tp_llama
9224cabd
kwen2501 Lint
79cc524e
kwen2501 Format and warning
a2934b33
kwen2501 kwen2501 force pushed to a2934b33 1 year ago
kwen2501
kwen2501 Disable copy-from check
a8fc418c
kwen2501
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker commented on 2024-10-30
kwen2501
ArthurZucker
kwen2501 Conditionally get attr from config
e84a3889
kwen2501 make fix-copies
396d158c
kwen2501 Move base_model_tp_plan to PretrainedConfig
7b346b55
kwen2501 Merge remote-tracking branch 'origin/main' into tp_llama
d60679b5
kwen2501
kwen2501 Merge remote-tracking branch 'origin/main' into tp_llama
dda058af
kmehant
kmehant commented on 2024-11-04
ArthurZucker
ArthurZucker commented on 2024-11-04
ArthurZucker
kwen2501
kwen2501
kwen2501 Move TP into from_pretrained
12fbbe70
kwen2501 Add device context for load
02c8c393
kwen2501 Do not serialize
073c521d
kwen2501 kwen2501 force pushed 1 year ago
kwen2501 kwen2501 force pushed 1 year ago
kwen2501 Move _tp_plan setting to post_init
db6e5eeb
kwen2501 kwen2501 force pushed to db6e5eeb 1 year ago
kwen2501
kwen2501 kwen2501 requested a review from ArthurZucker ArthurZucker 1 year ago
kwen2501
kwen2501
ArthurZucker
ArthurZucker
ArthurZucker approved these changes on 2024-11-14
kmehant
kwen2501
kwen2501
kmehant
ArthurZucker
kwen2501 Add has_tp_plan
5bb294ec
kwen2501 Add test_tp
290a7f18
kwen2501 Add 'Multi-gpu inference' doc
bd2e89c1
kwen2501 Merge remote-tracking branch 'origin/main' into tp_llama
4892ceff
kwen2501 Add backward support for device type identification
9648f316
kwen2501
ArthurZucker
ArthurZucker
ArthurZucker approved these changes on 2024-11-15
kwen2501 Auto-detect accelerator
93ba2835
kwen2501 supports_tp_plan
73524c90
kwen2501 copyright year
f312e551
kwen2501
kwen2501 commented on 2024-11-16
kwen2501 Merge remote-tracking branch 'origin/main' into tp_llama
ca93bdb9
kwen2501 kwen2501 force pushed to ca93bdb9 1 year ago
kwen2501 Merge branch 'main' into tp_llama
dc2672f8
kwen2501 Fix copy
1e27d6f9
kwen2501 kwen2501 force pushed to 1e27d6f9 1 year ago
kwen2501
ArthurZucker
ArthurZucker ArthurZucker merged 20142ab5 into main 1 year ago
ArthurZucker
ArthurZucker
loadams
kmehant
kwen2501
ArthurZucker

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone