llvm-project
ac47d8c2 - [msan] Handle Arm NEON BFloat16 multiply-add to single-precision (#178510)

Commit

88 days ago

[msan] Handle Arm NEON BFloat16 multiply-add to single-precision (#178510) aarch64.neon.bfmlalb/t perform dot-products after zeroing out the odd/even-indexed values. We handle these by generalizing handleVectorDotProductIntrinsic() and (mis-)using getPclmulMask().

References

#178510 - [msan] Handle Arm NEON BFloat16 multiply-add to single-precision

Author

thurstond

Parents

10792738

llvm-project ac47d8c2 - [msan] Handle Arm NEON BFloat16 multiply-add to single-precision (#178510)

llvm-project
ac47d8c2 - [msan] Handle Arm NEON BFloat16 multiply-add to single-precision (#178510)