pytorch
b14d6d73 - Reuse KernelSpec for FusionGroups with equivalent graphs (#14541)

Commit View On GitHub

Commit

5 years ago

Reuse KernelSpec for FusionGroups with equivalent graphs (#14541) Summary: Before this PR, loop unrolling + the graph fuser was creating multiple FusionGroups with the same bodies (with different variable names) for JIT LSTMs. Each FusionGroup got registered to a separate fusion key; each key resulted in a different compilation for the same specializations. This PR makes it so that when registering FusionGroups with the fusion compiler, the compiler first checks the KernelSpec cache to see if the FusionGroup's graph exists already. If it does, then return the corresponding KernelSpec's key to share compiled kernels. In addition, graphs in the KernelSpec cache are canonicalized before being cached. I added a flag to the canonicalize pass to remove unique names of values. This shortens the compile time for a JIT LSTM (seq_len of 100, loop unroll factor of 8) from 5.3s to 2.3s. Most of this compile time is running the graph fuser and/or fusion compiler; while this PR makes it so that there is only one unique kernel in the forward pass, there are a lot of different kernels (6) in the backward pass (after loop unrolling) that should be investigated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14541 Differential Revision: D13324487 Pulled By: zou3519 fbshipit-source-id: b841d82ed35a959b5cfc72db033bf5a7b42cc4fb

Author

zou3519

Committer

facebook-github-bot

Parents

aa022313

pytorch b14d6d73 - Reuse KernelSpec for FusionGroups with equivalent graphs (#14541)

Commit

pytorch
b14d6d73 - Reuse KernelSpec for FusionGroups with equivalent graphs (#14541)