llvm-project
e86a4294 - [NVPTX] Add missing mbarrier intrinsics (#164864)

Commit
180 days ago
[NVPTX] Add missing mbarrier intrinsics (#164864) This patch adds a few more mbarrier intrinsics, completing support for all the mbarrier variants up to Blackwell architecture. * Docs are updated in NVPTXUsage.rst. * lit tests are added for all the variants. * lit tests are verified with PTXAS from CUDA-12.8 toolkit. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Author
Parents
Loading