llama.cpp
CUDA: fix crash on large batch size for MoE models
#13384

Merged

CUDA: fix crash on large batch size for MoE models #13384

JohannesGaessler merged 1 commit into ggml-org:master from JohannesGaessler:cuda-fix-moe-max-ub

CUDA: fix crash on large batch size for MoE models

819f6c59

github-actions added Nvidia GPU

github-actions added ggml

CISC approved these changes on 2025-05-09

JohannesGaessler merged 5c86c9ed into master 1 year ago

Reviewers

CISC

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone