llama.cpp
CUDA: fix partial offloading for ne0 % 256 != 0
#8572
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
CUDA: fix partial offloading for ne0 % 256 != 0
#8572
JohannesGaessler
merged 1 commit into
ggml-org:master
from
JohannesGaessler:cuda-glm4-fix
JohannesGaessler
force pushed
1 year ago
github-actions
added
ggml
JohannesGaessler
force pushed
1 year ago
slaren
commented on 2024-07-18
JohannesGaessler
force pushed
to
992d7c41
1 year ago
CUDA: fix partial offloading for ne0 % 256 != 0
8784fcd5
JohannesGaessler
force pushed
from
992d7c41
to
8784fcd5
1 year ago
slaren
approved these changes on 2024-07-18
JohannesGaessler
merged
a15ef8f8
into master
1 year ago
forworldm
commented on 2024-08-02
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
forworldm
Assignees
No one assigned
Labels
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub