llama.cpp
Faster Q3_K implementation on Metal
#2307
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
Faster Q3_K implementation on Metal
#2307
ikawrakow
merged 4 commits into
master
from
ik/metal_faster_q3k
Faster Q3_K on Metal
5bb23b5a
Additional Q3_K speedup on Metal
8dba28c0
Q3_K for QK_K = 64
0099570f
Better Q3_K for QK_K = 64
d3c3624c
ikawrakow
requested a review
from
ggerganov
2 years ago
ggerganov
approved these changes on 2023-07-21
ikawrakow
merged
4d76a5f4
into master
2 years ago
ikawrakow
deleted the ik/metal_faster_q3k branch
2 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub