SemanticDiff

pytorch
77f32eb6 - [inductor] Enable CudaWrapperCodeGen for non-AOT mode (#98264)

Commit View On GitHub

Login via GitHub
Home
Pricing
FAQ
Install

Login via GitHub

Commit

1 year ago

[inductor] Enable CudaWrapperCodeGen for non-AOT mode (#98264) Summary: when _inductor.config.cpp_wrapper is specified, we run a two-pass wrapper codegen to generate wrapper code in cpp which calls cuLaunchKernel to launch pre-compiled cuda kernels, and then call load_inline to load that generated wrapper back into the python world. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98264 Approved by: https://github.com/ngimel

Author

desertfire

desertfire

Committer

pytorchmergebot

pytorchmergebot

Parents

FAQ Terms Privacy Refunds Impressum

Loading