[quant][core][gpu][improvement] Made plan and run for quantized linear op conform with Conv_v8.cpp
Summary:
See https://github.com/pytorch/pytorch/pull/76788's summary. Same idea
applies here but for the linear operator
Test Plan:
```
python test/test_quantization.py -k test_qlinear_cudnn
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/77519
Approved by: https://github.com/jerryzh168