pytorch
95fd1e90 - reduce number of randperm template instantiations (#58362)

Commit

3 years ago

reduce number of randperm template instantiations (#58362) Summary: Per title, benchmarks in https://github.com/pytorch/pytorch/issues/54113 don't regress, size of torch_cuda_cu_generated_Randperm.cu.o goes 8562152 -> 2585792 for a single architecture, compilation time decreases also. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58362 Reviewed By: heitorschueroff Differential Revision: D28477697 Pulled By: ngimel fbshipit-source-id: 32dbe44ca6b3807668d548512d7484f8488834c4

Author

Natalia Gimelshein

Committer

facebook-github-bot

Parents

a3b33139

pytorch 95fd1e90 - reduce number of randperm template instantiations (#58362)

pytorch
95fd1e90 - reduce number of randperm template instantiations (#58362)