llvm-project
46ef57a7 - [MLIR][NVVM] Add mbarrier.try_wait Op (#170285)

Commit
146 days ago
[MLIR][NVVM] Add mbarrier.try_wait Op (#170285) This patch adds an Op for mbarrier.try_wait operation which lowers to the corresponding intrinsics. This Op has support for an optional time-limit, state-or-phase as well as relaxed memory semantics, completing the features on this Op up to Blackwell. Unlike the existing `nvvm.mbarrier.try_wait.parity` Op, this Op does not provide a _blocking_ implementation. We intend to add looping around this at NVGPU in a subsequent PR (and deprecate the inline-asm based Op here). lit tests are added to verify the lowering to the intrinsics. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Author
Parents
Loading