llama.cpp
Faster Q3_K implementation on Metal
#2307
Merged

Faster Q3_K implementation on Metal #2307

ikawrakow merged 4 commits into master from ik/metal_faster_q3k
ikawrakow
Faster Q3_K on Metal
5bb23b5a
Additional Q3_K speedup on Metal
8dba28c0
Q3_K for QK_K = 64
0099570f
Better Q3_K for QK_K = 64
d3c3624c
ikawrakow ikawrakow requested a review from ggerganov ggerganov 2 years ago
ggerganov
ggerganov approved these changes on 2023-07-21
ikawrakow ikawrakow merged 4d76a5f4 into master 2 years ago
ikawrakow ikawrakow deleted the ik/metal_faster_q3k branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone