pytorch
4501b52f - Benchmark for torch.ops.quantized.linear_prepack_fp16 operator (#52229)

Commit
3 years ago
Benchmark for torch.ops.quantized.linear_prepack_fp16 operator (#52229) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52229 Create benchmarks for torch.ops.quantized.linear_prepack_fp16 and torch.ops.quantized.linear_unpack_fp16 operators Benchmark for these operators are written in the same format as the other benchmarks for other operators. Test Plan: linear_prepack_fp16 test was successfully run with various parameters: Sample test run output: ---------------------------------------- PyTorch/Caffe2 Operator Micro-benchmarks ---------------------------------------- Tag : long Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M8_N32_K256_cpu Input: M: 8, N: 32, K: 256, device: cpu Forward Execution Time (us) : 14.002 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M8_N32_K512_cpu Input: M: 8, N: 32, K: 512, device: cpu Forward Execution Time (us) : 14.114 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M8_N64_K256_cpu Input: M: 8, N: 64, K: 256, device: cpu Forward Execution Time (us) : 19.355 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M8_N64_K512_cpu Input: M: 8, N: 64, K: 512, device: cpu Forward Execution Time (us) : 19.056 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M128_N32_K256_cpu Input: M: 128, N: 32, K: 256, device: cpu Forward Execution Time (us) : 115.963 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M128_N32_K512_cpu Input: M: 128, N: 32, K: 512, device: cpu Forward Execution Time (us) : 116.259 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M128_N64_K256_cpu Input: M: 128, N: 64, K: 256, device: cpu Forward Execution Time (us) : 229.336 Benchmarking PyTorch: linear_prepack_fp16 Mode: Eager Name: linear_prepack_fp16_M128_N64_K512_cpu Input: M: 128, N: 64, K: 512, device: cpu Forward Execution Time (us) : 220.016 linear_unpack_fp16 test was successfully run with identical parameters Reviewed By: b-koopman Differential Revision: D26403343 fbshipit-source-id: 11a98e56177952b94f291006975b0b719f48d1b9
Author
Parents
Loading