llama.cpp
Vulkan Mixture of Experts (MoE) support
#7628
Merged

Vulkan Mixture of Experts (MoE) support #7628

0cc4m merged 8 commits into master from 0cc4m/vulkan-moe
0cc4m
0cc4m Finish Vulkan mul_mat_id implementation
579f059a
0cc4m Add Vulkan sum_rows and div ops
b4abdbb8
0cc4m Fix MUL_MAT_ID matrix matrix shader
45928e8d
0cc4m Merge remote-tracking branch 'origin/master' into 0cc4m/vulkan-moe
8ecdda1e
github-actions github-actions added Vulkan
github-actions github-actions added python
0cc4m Fix MUL_MAT_ID matrix vector shader dispatch size
c8f93774
lin72h
MaggotHATE
mofosyne mofosyne added Review Complexity : High
0cc4m
slaren
slaren
0cc4m Fix MUL_MAT_ID matrix vector shader and dispatch code
2c3d0b42
0cc4m Update Vulkan CPU offload for MUL_MAT_ID
6e0e0beb
0cc4m Fix crash when using split mode none and setting a main GPU
fe3f6958
0cc4m
MaggotHATE
slaren
slaren approved these changes on 2024-06-02
0cc4m 0cc4m merged 3d7ebf63 into master 1 year ago
0cc4m 0cc4m deleted the 0cc4m/vulkan-moe branch 1 year ago
github-actions

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone