Enable SLS FP32 accumulation SparseLengthsWeightedSumFused8BitRowwiseFakeFP32NNPI Op. (#41577)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/41577
* Remove skipping test
* Use fma_avx_emulation
* Increase test examples to 100
(Note: this ignores all push blocking failures!)
Test Plan: Tests are covered in test_sls_8bit_nnpi.py
Reviewed By: hyuen
Differential Revision: D22585742
fbshipit-source-id: e1f62f47eb10b402b11893ffca7a6786e31daa79