llama.cpp
metal : optimize MoE for large batches
#13388
Merged

metal : optimize MoE for large batches #13388

ggerganov merged 1 commit into master from gg/metal-mm-id-opt
ggerganov
ggerganov metal : optimize MoE for large batches
b6a4d533
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
ggerganov ggerganov merged 611aa914 into master 236 days ago
ggerganov ggerganov deleted the gg/metal-mm-id-opt branch 236 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone