pytorch
edf22f3a - Modify signature of dequantize ops for decomposed quantized Tensor (#119173) (#121450)

Commit View On GitHub

Commit

191 days ago

Modify signature of dequantize ops for decomposed quantized Tensor (#119173) (#121450) Summary: X-link: https://github.com/pytorch/executorch/pull/2308 Note: The initial purpose of this PR is to draw suggestion and feedback regarding better alternative, if any. At present, dequantize op for decomposed quantized Tensor representation e.g. dequantize_per_tensor() assumes the output dtype as torch.float and hence, it does not have the output dtype in its operator argument list. However, this op signature becomes unusable when the assumption breaks. Because, in case the output dtype is different from torch.float, there is no way to specify the same during dequantization. This change is aimed at generalizing the signature of dequantize op like dequantize_per_tensor() for wider use-cases where the output dtype can be different from torch.float and needs to passed during dequantization. The proposal is to use an additional argument named 'output_dtype' to solve the problem. However, we would also like to have suggestion and feedback regarding any better alternative that can be used instead. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen leslie-fang-intel Reviewed By: digantdesai Differential Revision: D53590486 Pulled By: manuelcandales Co-authored-by: kausik <kmaiti@habana.ai> Pull Request resolved: https://github.com/pytorch/pytorch/pull/121450 Approved by: https://github.com/jerryzh168

Author

kausikmaiti

Committer

pytorchmergebot

Parents

06d23920

pytorch edf22f3a - Modify signature of dequantize ops for decomposed quantized Tensor (#119173) (#121450)

Commit

pytorch
edf22f3a - Modify signature of dequantize ops for decomposed quantized Tensor (#119173) (#121450)