llama.cpp
ggml-cpu: arm64: q4_K repack gemm and gemv implementations (i8mm)
#16739
Merged

ggml-cpu: arm64: q4_K repack gemm and gemv implementations (i8mm) #16739

Alcpz
Alcpz Alcpz requested a review from ggerganov ggerganov 110 days ago
Alcpz Alcpz requested a review from slaren slaren 110 days ago
github-actions github-actions added ggml
Alcpz Alcpz changed the title ggml-cpu: arm64: q4_K repack gemm and gemv implementations ggml-cpu: arm64: q4_K repack gemm and gemv implementations (i8mm) 106 days ago
ggerganov
ggerganov dismissed these changes on 2025-10-27
Alcpz
ggerganov
ggerganov ggerganov dismissed their stale review 102 days ago
https://github.com/ggml-org/llama.cpp/pull/16739#issuecomment-3472806066
Alcpz
ggerganov
Alcpz
Alcpz Enabled q4_K_8x8_q8_K path on ARM
b3011aa3
Alcpz wip: I8mm qs multiplication, pending bias
d00dbf4c
Alcpz cpu : arm : REPACK gemm q4_K8x8 implementation
b9b0b362
Alcpz Guard gemm with proper features, improved superblock scale and min calc
f956373e
Alcpz cpu: arm: Implemented REPACK gemv for Q4_K
eb8449b0
Alcpz Removed completed TODO
8df2511b
Alcpz Fixed missing guards when selecting optimal repack type for Q4_K
a66f6695
Alcpz Fixed macro guard for gemv
1d7738ea
Alcpz Fixed wrong comment in GEMV
b36356fd
Alcpz Fixed warning for unused variable
92f61ead
Alcpz vdotq_s32 -> ggml_vdotq_s32
b0b5a27c
Alcpz Clang-format issues
8a2fd934
Alcpz Alcpz force pushed to 8a2fd934 88 days ago
Alcpz
Alcpz Alcpz requested a review from ggerganov ggerganov 82 days ago
ggerganov
ggerganov approved these changes on 2025-11-20
Alcpz
ggerganov
Alcpz
slaren
slaren commented on 2025-11-20
Alcpz Apply suggestions from code review
a8a1d19d
Alcpz Removed unnecessary GGML_UNUSED
486c027e
Alcpz Fixed guards in q4_k gemm and gemv (repack)
96af4393
slaren
slaren approved these changes on 2025-11-24
ggerganov ggerganov merged dbb852b5 into master 78 days ago
Alcpz Alcpz deleted the Alcpz/arm_q4_k_repack branch 75 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone