onnxruntime
90c5ffb5 - Integrate KleidiAI for MatMulNBits via MlasQNBitGemm (#23627)

Commit
285 days ago
Integrate KleidiAI for MatMulNBits via MlasQNBitGemm (#23627) ### Description This PR integrates ArmĀ® KleidiAIā„¢ to provide optimized assembly kernels for matrix multiplication with 4-bit quantized weights. These changes target the MlasQNBitGemm functions, and can be utilized via the MatMulNBits operator.
Parents
Loading