llvm-project
4c1b1f6d - [NVPTX] Add support for clamped funnel shift intrinsics (#113228)

Commit
344 days ago
[NVPTX] Add support for clamped funnel shift intrinsics (#113228) Add support for ``llvm.nvvm.fshl.clamp`` and ``llvm.nvvm.fshr.clamp`` intrinsics. These intrinsics are similar to the generic llvm funnel shift, except that the shift value is clamped to the integer width. Currently only ``i32`` is supported and is implemented with the `shf.[rl].clamp.b32` PTX instruction.
Author
Parents
Loading