Turn on padding (#101915)
🚀 🚀 🚀
Turns on torchinductor mm padding. Gives 4% HF training win at 5s compilation time increase. Results for mm tuning are cached.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/101915
Approved by: https://github.com/jansel