llama.cpp
CUDA: fix crash with partial offloading of MoE
#13439
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
CUDA: fix crash with partial offloading of MoE
#13439
JohannesGaessler
merged 1 commit into
ggml-org:master
from
JohannesGaessler:cuda-fix-partial-mmid
github-actions
added
Nvidia GPU
github-actions
added
ggml
JohannesGaessler
force pushed
328 days ago
CUDA: fix crash with partial offloading of MoE
4bc8f75d
JohannesGaessler
force pushed
to
4bc8f75d
328 days ago
slaren
approved these changes on 2025-05-11
JohannesGaessler
merged
7474e00b
into master
327 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
slaren
Assignees
No one assigned
Labels
Nvidia GPU
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub