Cuda quantized tensors, support for quantize per tensor (#59700)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/59700
implements quantized tensors in cuda for for per_tensor
quantization, along with several necessary functions
(Note: this ignores all push blocking failures!)
Test Plan:
python test/test_quantization.py TestQuantizedTensors
python test/test_quantization.py
TestQuantizedTensors.test_compare_quant_dequant_device_numerics
python test/test_quantization.py
TestQuantizedTensors.test_qtensor_to_device
Imported from OSS
Reviewed By: jerryzh168
Differential Revision: D29018272
fbshipit-source-id: e07d19d6d67729c46324c2bb5946d959e6e6db8e