onnxruntime
280b2634 - Prompt layer-wise recompute when applicable (#20126)

Commit

2 years ago

Prompt layer-wise recompute when applicable (#20126) ### Prompt layer-wise when applicable Give explicit prompts in export failures to users to enable layer-wise memory optimization if we found the checkpoint function is used. - Using checkpoint function is a strong indicator that the model is too large to fit in GPU memory. - If we don't override the checkpoint function here, mostly ONNX export will be failed. 1. For old version PyTorch, when handling gradient checkpoint feature, we just throw an exception. 2. For new version PyTorch, an export failure happens. - But both failures did not give users explicitly "HOW" to mitigate. This PR did that. `` ![image](https://github.com/microsoft/onnxruntime/assets/10530022/c0476748-5818-4cc8-b2d6-88c7580fe4da) ### Motivation and Context

References

#20126 - Prompt layer-wise recompute when applicable

Author

pengwa

Parents

14d7872c

onnxruntime 280b2634 - Prompt layer-wise recompute when applicable (#20126)

onnxruntime
280b2634 - Prompt layer-wise recompute when applicable (#20126)