onnxruntime
7cf9263e - Add changes for strided calibration (#20949)

Commit
1 year ago
Add changes for strided calibration (#20949) Context and motivation: When quantizing large transformer models, we faced OOM issue when the number of calibration samples goes up. To resolve this, in the PR we want to add support for reading quantization data in chunck, calculating ranges for intermediate tensors, then accumulating results for the final ranges.
Author
Parents
Loading