Benchmark for torch.ops.quantized.linear_prepack_fp16 operator (#52229)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/52229
Create benchmarks for
torch.ops.quantized.linear_prepack_fp16 and torch.ops.quantized.linear_unpack_fp16 operators
Benchmark for these operators are written in the same format as the other benchmarks for other operators.
Test Plan:
linear_prepack_fp16 test was successfully run with various parameters:
Sample test run output:
----------------------------------------
PyTorch/Caffe2 Operator Micro-benchmarks
----------------------------------------
Tag : long
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M8_N32_K256_cpu
Input: M: 8, N: 32, K: 256, device: cpu
Forward Execution Time (us) : 14.002
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M8_N32_K512_cpu
Input: M: 8, N: 32, K: 512, device: cpu
Forward Execution Time (us) : 14.114
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M8_N64_K256_cpu
Input: M: 8, N: 64, K: 256, device: cpu
Forward Execution Time (us) : 19.355
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M8_N64_K512_cpu
Input: M: 8, N: 64, K: 512, device: cpu
Forward Execution Time (us) : 19.056
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M128_N32_K256_cpu
Input: M: 128, N: 32, K: 256, device: cpu
Forward Execution Time (us) : 115.963
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M128_N32_K512_cpu
Input: M: 128, N: 32, K: 512, device: cpu
Forward Execution Time (us) : 116.259
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M128_N64_K256_cpu
Input: M: 128, N: 64, K: 256, device: cpu
Forward Execution Time (us) : 229.336
Benchmarking PyTorch: linear_prepack_fp16
Mode: Eager
Name: linear_prepack_fp16_M128_N64_K512_cpu
Input: M: 128, N: 64, K: 512, device: cpu
Forward Execution Time (us) : 220.016
linear_unpack_fp16 test was successfully run with identical parameters
Reviewed By: b-koopman
Differential Revision: D26403343
fbshipit-source-id: 11a98e56177952b94f291006975b0b719f48d1b9