onnxruntime
--shm-size=1024m to fix nccl shared memory issue
#5214
Merged

--shm-size=1024m to fix nccl shared memory issue #5214

liqunfu merged 2 commits into master from liqun/nccl_shm
liqunfu
--shm-size=1024m to fix nccl shared memory issue
50d7be58
liqunfu liqunfu requested a review 5 years ago
liqunfu liqunfu requested a review from edgchen1 edgchen1 5 years ago
liqunfu liqunfu requested a review from snnn snnn 5 years ago
liqunfu liqunfu requested a review from ytaous ytaous 5 years ago
liqunfu liqunfu requested a review from thiagocrepaldi thiagocrepaldi 5 years ago
liqunfu liqunfu requested a review from spandantiwari spandantiwari 5 years ago
--shm-size=256m
c650187e
ytaous
ytaous approved these changes on 2020-09-17
liqunfu liqunfu merged f37e1292 into master 5 years ago
liqunfu liqunfu deleted the liqun/nccl_shm branch 5 years ago
edgchen1 edgchen1 added release:orttraining_rc3
edgchen1 edgchen1 added triage:approved
faxu faxu added release:1.5.0
faxu faxu removed release:1.5.0

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone