pytorch
aa0ca994 - [Inductor] add missing ops for cpp vectorization overrides (#90750)

Commit

1 year ago

[Inductor] add missing ops for cpp vectorization overrides (#90750) For micro-benchmark, aten.elu.default and aten.elu_backward.default have poor performance with inductor compared to eager. The main reason is lack of the vectorization. With adding missing ops for cpp vectorization overrides, the vectorization could be successfully applied. Performance data for eager v.s. inductor: <html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40"> <head> <meta name=ProgId content=Excel.Sheet> <meta name=Generator content="Microsoft Excel 15"> <link id=Main-File rel=Main-File href="file:///C:/Users/xuanliao/AppData/Local/Temp/msohtmlclip1/01/clip.htm"> <link rel=File-List href="file:///C:/Users/xuanliao/AppData/Local/Temp/msohtmlclip1/01/clip_filelist.xml">  </head> <body link="#0563C1" vlink="#954F72"> op | speedup_old | RSD (3) | speedup_new | RSD (3) | increased_performance -- | -- | -- | -- | -- | -- aten.elu.default | 0.205947276 | 1.73% | 0.995302802 | 4.76% | 383.28% aten.elu_backward.default | 0.336280639 | 0.58% | 1.69473642 | 1.96% | 403.96% </body> </html> The new supported ops for cpp vectorization overrides: - eq - ne - lt - gt - le - ge Pull Request resolved: https://github.com/pytorch/pytorch/pull/90750 Approved by: https://github.com/jgong5, https://github.com/EikanWang, https://github.com/jansel, https://github.com/desertfire

Author

Valentine233

Committer

pytorchmergebot

Parents

1d2bfea3

pytorch aa0ca994 - [Inductor] add missing ops for cpp vectorization overrides (#90750)

pytorch
aa0ca994 - [Inductor] add missing ops for cpp vectorization overrides (#90750)