onnxruntime
b0f8ec7a - Script for converting BERT model for performance optimization (#2037)

Commit
6 years ago
Script for converting BERT model for performance optimization (#2037) * script for converting ONNX model for BERT performance optimization * Remove code that not needed anymore. * refine the script * Support BERT model exported from PyTorch 1.3 Keep opset version Exact match in Attention, Layer normalziation fusions. * read batch_size from input model directly
Author
Parents
Loading