optimum
ae907fdb - Enable past_key_values for ORTModelForCausalLM (#326)

Commit
3 years ago
Enable past_key_values for ORTModelForCausalLM (#326) * Add model with a causal language modeling head ONNX Runtime inference * Modify onnx config for consistency with transformers onnx * Refactoring and debuged * Add IOBinding * Remove useless * Delete previous and update doc * Add test for with_past * Further clean-up * Fix causal modeling test * Oups fixed in wrong places * Add use_io_binding to docstring * Makes ORTModelDecoder parameters explicit * Remove unused kwargs * Makes ORTModelDecoder parameters explicit * Add decoder filename as default value * Fix test * Fix test * Fix docstring Co-authored-by: JingyaHuang <huang_jingya@outlook.com>
Author
Parents
Loading