llvm-project
d18a784d - [compiler-rt] Define GPU specific handling of profiling functions (#185763)

Commit
21 days ago
[compiler-rt] Define GPU specific handling of profiling functions (#185763) Summary: The changes in https://www.github.com/llvm/llvm-project/pull/185552 allowed us to start building the standard `libclang_rt.profile.a` for GPU targets. This PR expands this by adding an optimized GPU routine for counter increment and removing the special-case handling of these functions in the OpenMP runtime. Vast majority of these functions are boilerplate, but we should be able to do more interesting things with this in the future, like value or memory profiling.
Author
Parents
Loading