pytorch
9a992b09 - [4/4] Intel GPU Runtime Upstreaming for Device (#116869)

Commit View On GitHub

Commit

228 days ago

[4/4] Intel GPU Runtime Upstreaming for Device (#116869) # Motivation According to [[1/4] Intel GPU Runtime Upstreaming for Device](https://github.com/pytorch/pytorch/pull/116019), as mentioned in [[RFC] Intel GPU Runtime Upstreaming](https://github.com/pytorch/pytorch/issues/114842), this last PR covers the changes under lazy initialization. # Design This PR primarily offers the support of multi-processing via lazy initialization. We lazily initialize our runtime avoiding initializing XPU until the first time it is accessed. In our design, we extend `cuda_lazy_init` to `device_lazy_init` which is a device-agnostic API that can support any backend. And change `maybe_initialize_cuda` to `maybe_initialize_device` to support lazy initialization for both CUDA and XPU while maintaining scalability. # Additional Context We adopt a similar design to CUDA. So we share some code with CUDA. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116869 Approved by: https://github.com/EikanWang, https://github.com/jgong5, https://github.com/gujinghui, https://github.com/malfet ghstack dependencies: #119248

Author

guangyey

Committer

pytorchmergebot

Parents

3cb7ec31

pytorch 9a992b09 - [4/4] Intel GPU Runtime Upstreaming for Device (#116869)

Commit

pytorch
9a992b09 - [4/4] Intel GPU Runtime Upstreaming for Device (#116869)