llvm-project
d72bc090 - AMDGPU: Use fpmath metadata on f16 log/log10 intrinsics (#180489)

Commit
78 days ago
AMDGPU: Use fpmath metadata on f16 log/log10 intrinsics (#180489) result by default, and the old expansion with the afn flag. The old result was good enough for OpenCL conformance, so consider the fpmath metadata and use the fast path. This is done in AMDGPUCodeGenPrepare for the same reason that sqrt is handled here, which is the DAG does not have a way to access fpmath metadata from the original instruction. This is not yet of practical use, because the log calls sourced from OpenCL are not actually marked with this metadata and there isn't a method to produce it from the source languages.
Author
Parents
Loading