onnxruntime
cbdd0bb7 - QAttention calls into MatMulIntToFloat instead of Dequantize+GEMM (#16851)

Commit
2 years ago
QAttention calls into MatMulIntToFloat instead of Dequantize+GEMM (#16851) ### Description Update QAttention calling into MatMulIntToFloat instead of Dequantize+GEMM to enable more metacommand path.
Parents
Loading