[quant][core][feature] Implemented index_put for QuantizedCPU tensors
Summary:
Previously, there was no support for index_put for quantized tensors.
This PR introduces the feature for quantized CPU tensors, but is limited
to per tensor quantized tensors and accumulate=False. A
corresponding test case was added to test the impl.
Most of the implementation is copied over from the fp kernel and adapted
for quantized tensors.
TODO: implement index_put for QuantizedCUDA tensors.
Test Plan:
```
python test/test_quantization.py -k test_qtensor_index_put
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/78384
Approved by: https://github.com/vkuzo