onnxruntime
cd0af7ad - Symmetric quantized convolution kernel ARM64 (#9772)

Commit
4 years ago
Symmetric quantized convolution kernel ARM64 (#9772) Adding a symmetric quantized convolution kernel for ARM64 Note: Indirect conv performs worse for shallow convs (input channels are small). This is much more so for low end pre-dot CPUs, where only 128 or deeper conv is faster with indirect conv. With DOT-CPUs, 32 deep conv is already faster Co-authored-by: Chen Fu <fuchen@microsoft.com>
Author
Parents
Loading