optimum
2155fbe6 - Enable inference with a merged decoder in `ORTModelForCausalLM` (#647)

Commit
3 years ago
Enable inference with a merged decoder in `ORTModelForCausalLM` (#647) * Add save option * Add test for saving * Fix test path * Allow str path for merging * Add Path and remove merged names * Finish adapting ORTModelDecoder * Prepare extra inputs * do not store merged in place * Support I/O binding for merged * Extend to multiple patterns * Add test for inference * Fix test * update test * Remove prints * add back missing method * fix mess * fix post merge * none defaut * fix * fix errors * remove nonsense tests * fix doc * ongoing * debug * fix style * fix post merge * hopefully working! * add tests * rename * add constants * fix test * fix names * ort support * wip * fix * fix * tests for merged * stype * fix * fix merge errors * fix tests * fix test * remove irrelevant test * Update optimum/exporters/onnx/__main__.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/base.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/config.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/config.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/config.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/config.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/exporters/onnx/config.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/onnxruntime/modeling_decoder.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update tests/exporters/onnx/test_exporters_onnx_cli.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * fix on suggestions * fix import of dummyinputgenerators * skip unwanted tests * fix diffusion model * fix tests * Update optimum/commands/export/onnx.py Co-authored-by: Jingya HUANG <44135271+JingyaHuang@users.noreply.github.com> * Update optimum/onnxruntime/modeling_decoder.py Co-authored-by: Jingya HUANG <44135271+JingyaHuang@users.noreply.github.com> * fix last tests * Update optimum/onnx/graph_transformations.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/onnx/graph_transformations.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/onnx/graph_transformations.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/onnx/graph_transformations.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * Update optimum/onnxruntime/modeling_decoder.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * fix signature and docstrings * add error message if post process fails * tiny fix * last fixes * typo --------- Co-authored-by: Félix Marty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>
Author
Parents
Loading