onnxruntime
[MLAS] add q4 quantize and transpose kernel to support MatMulNBits QDQ fuse
#21054
Merged

Loading