llama.cpp
Q4/Q8 Tiled Gemm Optimization.
#16999
Open

Q4/Q8 Tiled Gemm Optimization. #16999

shalinib-ibm wants to merge 1 commit into ggml-org:master from shalinib-ibm:q8_q4_opt
shalinib-ibm
shalinib-ibm Q4/Q8 Tiled Gemm Optimization.
52fb79b9
shalinib-ibm shalinib-ibm requested a review from ggerganov ggerganov 2 days ago
shalinib-ibm shalinib-ibm requested a review from slaren slaren 2 days ago
shalinib-ibm
github-actions github-actions added ggml
shalinib-ibm
ggerganov
ggerganov commented on 2025-11-05

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone