llama.cpp
CUDA: MoE helper in device code, better tile sizes
#15525
Merged

CUDA: MoE helper in device code, better tile sizes #15525

JohannesGaessler
JohannesGaessler CUDA: MoE helper in device code, better tile sizes
8bb55de6
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler reduce superfluous CUDA blocks
07c814b5
JohannesGaessler
IMbackK
IMbackK requested changes on 2025-08-23
JohannesGaessler try AMD fix
e7b884da
JohannesGaessler
IMbackK
IMbackK
IMbackK
IMbackK
IMbackK approved these changes on 2025-08-23
JohannesGaessler raise shared memory limit
57249900
JohannesGaessler
JohannesGaessler reduce shared memory use
a2f702a9
JohannesGaessler
JohannesGaessler add assert
1d609235
JohannesGaessler JohannesGaessler merged 5eff6ec9 into master 140 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone