Use cub 1.15's latest scan-by-key algorithm to replace thrust for Embedding.cu and EmbeddingBag.cu (#66580)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66580
Reviewed By: mruberry
Differential Revision: D34116388
Pulled By: ngimel
fbshipit-source-id: 2e8936ca7c10f96a8e7a5696248f56bf87290d6e
(cherry picked from commit 51cff8cb1de725bca52d5137b01b16d054b95f63)