llama.cpp
More optimizations on metal
#2959
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
8
Changes
View On
GitHub
More optimizations on metal
#2959
ggerganov
merged 8 commits into
master
from
ik/more_metal_optimizations
Very minor speedup via simd-group synchronization in f16 x f32
2cb47e0e
Another very minor speedup on metal
e3ff8c20
Quite significant PP speedup on metal
2b601702
Another attempt
b557bc32
Minor
74df0de9
ikawrakow
requested a review
from
ggerganov
2 years ago
Merge branch 'master' into ik/more_metal_optimizations
01eed465
Massive improvement for TG for fp16
363f0bf5
~4-5% improvement for Q8_0 TG on metal
6af0bab3
ggerganov
approved these changes on 2023-09-03
ggerganov
merged
ca82cf7b
into master
2 years ago
ggerganov
commented on 2023-09-03
ikawrakow
deleted the ik/more_metal_optimizations branch
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub