llama.cpp
Q4/Q8 Tiled Gemm Optimization.
#16999
Merged

Q4/Q8 Tiled Gemm Optimization. #16999

taronaeo merged 3 commits into ggml-org:master from shalinib-ibm:q8_q4_opt
shalinib-ibm
shalinib-ibm Q4/Q8 Tiled Gemm Optimization.
52fb79b9
shalinib-ibm shalinib-ibm requested a review from ggerganov ggerganov 102 days ago
shalinib-ibm shalinib-ibm requested a review from slaren slaren 102 days ago
shalinib-ibm
github-actions github-actions added ggml
shalinib-ibm
ggerganov
ggerganov commented on 2025-11-05
shalinib-ibm shalinib-ibm requested a review from ggerganov ggerganov 99 days ago
shalinib-ibm shalinib-ibm force pushed from b663ced4 to 6cdfffd6 74 days ago
shalinib-ibm shalinib-ibm force pushed from 6cdfffd6 to 2c1171b3 74 days ago
shalinib-ibm shalinib-ibm force pushed from 2c1171b3 to 03539089 74 days ago
shalinib-ibm shalinib-ibm force pushed from 03539089 to c33dffb9 74 days ago
shalinib-ibm
shalinib-ibm
taronaeo
shalinib-ibm shalinib-ibm force pushed from c33dffb9 to 88a9f0b4 73 days ago
shalinib-ibm
shalinib-ibm shalinib-ibm force pushed from 88a9f0b4 to 38077cda 73 days ago
ggerganov
ggerganov commented on 2025-12-03
shalinib-ibm Remove dymanic memory allocation during rutime
f72387fb
shalinib-ibm shalinib-ibm force pushed from 38077cda to f72387fb 73 days ago
ggerganov
ggerganov approved these changes on 2025-12-03
shalinib-ibm
taronaeo
taronaeo approved these changes on 2025-12-04
shalinib-ibm Update ggml/src/ggml-cpu/llamafile/sgemm-ppc.h
d7759323
shalinib-ibm
taronaeo
taronaeo taronaeo merged 3a0d1053 into master 71 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone