mul under strict-fp. (#169156)

Commit

159 days ago

[ARM] Introduce intrinsics for MVE add/sub/mul under strict-fp. (#169156) As far as I understand, the MVE fp vadd/vsub/vmul instructions will set exception flags in the same ways as scalar fadd/fsub/fmul, but will not honor flush-to-zero (for f32 they always flush, for f16 they follows the fpsrc flags) and will always use the default rounding mode. This means that we cannot convert the vadd_f23/vsub_f32/vmul_f32 intrinsics to llvm.constrained.fadd/fsub/fmul and then vadd/vsub/vmul without changing the expected behaviour under strict-fp. This patch introduces a set in intrinsics that we can use instead, going from vadd_f32 -> llvm.arm.mve.vadd -> MVE_VADD. The current implementations assumes that the standard variant of a strictfp alternative will be a IRBuilder, this can be changed to take a IRBuilder or IRInt.

References

#169156 - [ARM] Introduce intrinsics for MVE add/sub/mul under strict-fp.

Author

davemgreen

Parents

44a7d2f2

llvm-project 1d64fd5d - [ARM] Introduce intrinsics for MVE add/sub/mul under strict-fp. (#169156)

llvm-project
1d64fd5d - [ARM] Introduce intrinsics for MVE add/sub/mul under strict-fp. (#169156)