pytorch
2cb7c3f8 - [dynamo][benchmarks] Prepone Cold start setup (#87913)

Commit

2 years ago

[dynamo][benchmarks] Prepone Cold start setup (#87913) Parallel compilation warms the Threadpool when we call `torch._dynamo.optimize()`. In current benchmarks, we were setting up the TRITON_CACHE_DIR much later. Because of this parallel compilation artifacts were not used and compilation latency improvements were not visible in dashboard. This PR just prepones the setup of TRITON_CACHE_DIR. cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx Pull Request resolved: https://github.com/pytorch/pytorch/pull/87913 Approved by: https://github.com/wconstab

Author

anijain2305

Committer

pytorchmergebot

Parents

641d8e0e

pytorch 2cb7c3f8 - [dynamo][benchmarks] Prepone Cold start setup (#87913)

pytorch
2cb7c3f8 - [dynamo][benchmarks] Prepone Cold start setup (#87913)