SemanticDiff

pytorch
3d7c9abb - Refactor thread_reduce for better unrolling and vectorization in the future (#36014)

Commit View On GitHub

Login via GitHub
Home
Pricing
FAQ
Install

Login via GitHub

Commit

4 years ago

Refactor thread_reduce for better unrolling and vectorization in the future (#36014) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36014 Benchmark on RTX2080Ti: 2.13ms vs 1.88ms https://github.com/zasdfgbnm/things/blob/master/2020Q2/reduction-benchmark-refactor.ipynb Test Plan: Imported from OSS Differential Revision: D20927535 Pulled By: ngimel fbshipit-source-id: b65b749b58cebe0751e4ec7e1cf359543c401580

Author

zasdfgbnm

zasdfgbnm

Committer

facebook-github-bot

facebook-github-bot

Parents

FAQ Terms Privacy Refunds Impressum

Loading