llama.cpp
ggml-cuda: use universal launch bounds for MoE MMVQ kernel
#24547
Open

ggml-cuda: use universal launch bounds for MoE MMVQ kernel #24547

batot1
batot1 ggml-cuda: use universal launch bounds for MoE MMVQ kernel
8b153af4
batot1 batot1 requested a review 2 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone