llama.cpp
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
#9921
Merged

backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels #9921

chaxu01
github-actions github-actions added examples
github-actions github-actions added ggml
chaxu01 chaxu01 force pushed 1 year ago
slaren
chaxu01
slaren
chaxu01
slaren
chaxu01
chaxu01 chaxu01 force pushed 1 year ago
chaxu01
slaren
chaxu01 chaxu01 force pushed 1 year ago
chaxu01
slaren
chaxu01
slaren
chaxu01
slaren
chaxu01
slaren
slaren
chaxu01
chaxu01
slaren
chaxu01
chaxu01 backend-cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
647eb316
chaxu01 refactor add new buffer type for online flow
b632bf0f
chaxu01 retain the tensor type as Q4_0
5947d72c
chaxu01 add check for tensor dimensions
871036d2
Srihari-mcw
chaxu01 chaxu01 force pushed to 871036d2 1 year ago
chaxu01
slaren
slaren
chaxu01 rebased onto commit a0a4646
76d89758
chaxu01
chaxu01 fix build error
2268ce0c
eddnjjn
slaren
slaren commented on 2024-11-08
slaren Update ggml/CMakeLists.txt
74d660ab
slaren Merge remote-tracking branch 'origin/master' into feature/online-flow
749a9e5e
slaren slaren force pushed to 749a9e5e 1 year ago
slaren
slaren approved these changes on 2024-11-14
slaren slaren merged 1607a5e5 into master 1 year ago
chaxu01
slaren
Srihari-mcw
slaren
Srihari-mcw
bartowski1182
slaren
Vali-98
ggerganov
Vali-98
LostRuins
doonny
chaxu01 chaxu01 deleted the feature/online-flow branch 115 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone