onnxruntime
8b0e0f49 - Add RemovePadding and RestorePadding for BERT model (#13701)

Commit
3 years ago
Add RemovePadding and RestorePadding for BERT model (#13701) Add two operators RemovePadding and RestorePadding based on ideal of effective transformer (https://github.com/bytedance/effective_transformer) to improve large batch size inference for BERT model.
Author
Parents
Loading