llama.cpp
CUDA: fix crash on large batch size for quant. MoE
#13537
Merged

CUDA: fix crash on large batch size for quant. MoE #13537

JohannesGaessler
JohannesGaessler CUDA: fix crash on large batch size for quant. MoE
634be72d
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
slaren
slaren commented on 2025-05-14
slaren
slaren approved these changes on 2025-05-14
jukofyork
JohannesGaessler JohannesGaessler merged 4696d567 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone