pytorch
26a91a9f - [WIP][JIT] Add benchmarking support of NV Fuser with FP16 dtype support (#44101)

Commit

4 years ago

[WIP][JIT] Add benchmarking support of NV Fuser with FP16 dtype support (#44101) Summary: Modified files in `benchmarks/tensorexpr` to add support for NVIDIA's Fuser for the jit compiler. This support has some modifications besides adding an option to support the NVIDIA fuser: * Adds FP16 Datatype support * Fixes SOL/Algo calculations to generally use the data type instead of being fixed to 4 bytes * Adds IR printing and kernel printing knobs * Adds a knob `input_iter` to create ranges of inputs currently only for reductions * Adds further reduction support for Inner and Outer dimension reductions that are compatible with the `input_iter` knob. * Added `simple_element`, `reduce2d_inner`, and `reduce2d_outer` to isolate performance on elementwise and reduction operations in the most minimal fashion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44101 Reviewed By: ngimel Differential Revision: D23713658 Pulled By: bertmaher fbshipit-source-id: d6b83cfab559aefe107c23b3c0f2df9923b3adc1

Author

kevinstephano

Committer

facebook-github-bot

Parents

2f4c31ce

pytorch 26a91a9f - [WIP][JIT] Add benchmarking support of NV Fuser with FP16 dtype support (#44101)

pytorch
26a91a9f - [WIP][JIT] Add benchmarking support of NV Fuser with FP16 dtype support (#44101)