[torch] Add fmsub to vectrozation primitives (#86568)
Summary: Add fmsub which is similar to fmadd
Test Plan: CI
Differential Revision: D40215267
Pull Request resolved: https://github.com/pytorch/pytorch/pull/86568
Approved by: https://github.com/ajtulloch, https://github.com/malfet