llvm-project
7803b4a6 - AMDGPU: Add support for llvm.trap to handling of intrinsics with !nocallback (#175230)

Commit
9 days ago
AMDGPU: Add support for llvm.trap to handling of intrinsics with !nocallback (#175230) This adds support to whitelist trap intrinsics while handling of intrinsics with !nocallback. This fixes the reasons behind the previous revert of #131759. The attributor was exiting early whenever it saw intrinsics without the nocallback bit, so trap-only kernels lost all the inferred “no implicit arg” metadata and their amdgpu-agpr-alloc=0 guarantees. That conservative fallback broke certain workloads by forcing unnecessary implicit arguments and AGPR reservations. This patch allows the pass to recognize leaf-like trap intrinsics, so they no longer poison the analysis. --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>
Author
Parents
Loading