llvm-project
903cfcf1 - [NVVM] Update properties for non-sync variants of the SHFL intrinsics (#189615)

Commit
11 days ago
[NVVM] Update properties for non-sync variants of the SHFL intrinsics (#189615) Non-sync SHFL variants (shfl without .sync) are pure functions of their SSA operands and the active thread mask. Assign IntrReadMem, IntrInaccessibleMemOnly and IntrWillReturn so that: - Reading the implicit mask state is modeled for correct ordering with other convergent operations - Truly dead non-sync shfl code can still be DCE'd Sync SHFL variants keep IntrInaccessibleMemOnly (no IntrReadMem, no IntrWillReturn) to model synchronization side effects and prevent unsafe DCE/reordering.
Author
Parents
Loading