[inductor] Support multiple symbolic numel expr in CudaWrapperCodeGen (#102093)
Summary: Add a set to avoid generating extra `auto` when seeing the
symbolic numel expression for the second time.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/102093
Approved by: https://github.com/jansel