optimum
a31e59ed - Add ORT inference (#113)

Commit

3 years ago

Add ORT inference (#113) * added gpu extras and added > transformers for token-classification pipeline issue * added numpy and huggingface hub to required packages * added modeling_* classes * adding tests and pipelines * remove vs code folder * added test model and adjusted gitignore * add readme for tests * working tests * added some documentation * will ci run? * added real model checkpoints * test ci * fix styling * fix some documentation * more doc fixes * added some feedback and wording from michael and lewis * renamed model class to ORTModelForXX * moved from_transformers to from_pretrained * applied ellas feedback * make style * first version of ORTModelForCausalLM without past-keys * added first draft of new .optimize method * added better quantize method * fix import * remove optimize and quantize * added lewis feedback * added style for test * added >>> to code snippets * style * added condition for staging tests * feedback morgan & michael * added action * forgot to install pytest * forgot sentence piece * made sure we won't have import conflicts * make style happy

References

#113 - Add ORT inference

Author

philschmid

Parents

74172026

Files20

.github/workflows
- test_modeling_ort.yml
.gitignore
docs/source
- _toctree.yml
- onnxruntime
  - modeling_ort.mdx
- pipelines.mdx
- quickstart.mdx
optimum
- modeling_base.py
- onnxruntime
  - __init__.py
  - modeling_ort.py
  - utils.py
- pipelines.py
- utils
  - __init__.py
  - testing_utils.py
setup.py
tests
- README.md
- assets
  - hub
    - config.json
  - onnx
    - config.json
    - model.onnx
- onnxruntime
  - test_modeling_ort.py
- test_modeling_base.py

optimum a31e59ed - Add ORT inference (#113)

optimum
a31e59ed - Add ORT inference (#113)