Fix embedding renormalization on cpu (#28546)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/28546
Fix #28370 repro
Test Plan: Imported from OSS
Differential Revision: D18251533
Pulled By: albanD
fbshipit-source-id: cd9ab609797b8c887ec9128752cc6a2f58a9aee6