[inductor] Add aten.multinomial to disallowed cudagraphs ops (#108105)
Fixes:
```python
CUDA_LAUNCH_BLOCKING=1 ./benchmarks/dynamo/torchbench.py --inference --performance --no-skip --inductor --freezing --only nanogpt_generate
loading model: 0it [00:00, ?it/s]number of parameters: 123.69M
loading model: 0it [00:07, ?it/s]
cuda eval nanogpt_generate
ERROR:common:Backend dynamo failed in warmup()
Traceback (most recent call last):
File "/data/users/jansel/pytorch/torch/_inductor/cudagraph_trees.py", line 1084, in _record
static_outputs = model(inputs)
File "/data/users/jansel/pytorch/torch/_inductor/codecache.py", line 401, in _run_from_cache
return compiled_graph.compiled_artifact(inputs)
File "/tmp/torchinductor_jansel/db/cdbk4ip3fucyoccnbnoik2crjpdkliwxll653l7l3wwsxiygmade.py", line 18375, in call
buf239 = aten.multinomial.default(buf238, 1)
File "/data/users/jansel/pytorch/torch/_ops.py", line 448, in __call__
return self._op(*args, **kwargs or {})
RuntimeError: CUDA error: operation not permitted when stream is capturing
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/108105
Approved by: https://github.com/eellison
ghstack dependencies: #108096, #108087, #108098