onnxruntime
[LARCH64 CPU]Provide inference acceleration optimization for Loongson CPU with 4-bit quantized models
#26280
Merged

[LARCH64 CPU]Provide inference acceleration optimization for Loongson CPU with 4-bit quantized models #26280

movedancer
This submission includes five modifications, which are as follows:
2ce037c4
movedancer
snnn
snnn commented on 2025-10-14
movedancer movedancer requested a review from snnn snnn 255 days ago
snnn
movedancer
movedancer Update sqnbitgemm_kernel_lasx_common.h
b3407de5
movedancer Update sqnbitgemm_kernel_lasx.cpp
7f841963
movedancer
snnn
snnn commented on 2025-10-14
movedancer Update sqnbitgemm_kernel_lasx_common.h
4cae30ac
movedancer Update sqnbitgemm_kernel_lasx.cpp
a4c9b718
snnn snnn closed this 253 days ago
snnn snnn reopened this 253 days ago
snnn
azure-pipelines
movedancer Add MlasAlignedAllocator for aligned memory allocation
0a99f583
hariharans29
azure-pipelines
snnn
snnn approved these changes on 2025-10-17
snnn snnn merged 7cc28b0b into main 251 days ago
movedancer
movedancer movedancer deleted the add-more-loongson-support branch 251 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone