llama.cpp
CUDA: stream-k decomposition for MMQ
#8018
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
CUDA: stream-k decomposition for MMQ
#8018
JohannesGaessler
merged 2 commits into
ggml-org:master
from
JohannesGaessler:cuda-mmq-stream-k-2
CUDA: stream-k decomposition for MMQ
da1db13d
github-actions
added
Nvidia GPU
github-actions
added
ggml
JohannesGaessler
added
Review Complexity : High
fix undefined memory reads for small matrices
141d0810
slaren
approved these changes on 2024-06-20
JohannesGaessler
merged
d50f8897
into master
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
Assignees
No one assigned
Labels
Nvidia GPU
Review Complexity : High
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub