vulkan: Update topk_moe fusion to handle gpt's late softmax #16656
0cc4m
commented
on 2025-10-25
vulkan: Update topk_moe fusion to handle gpt's late softmax
6cccaef3
Add ggml_check_edges
81853b56
Add sync logging to show fusion effects
180eef4d
handle clamp added in #16655
b046c734
0cc4m
approved these changes
on 2025-10-29
slaren
approved these changes
on 2025-10-29
Update ggml/src/ggml-impl.h
832ea836
0cc4m
merged
10fcc412
into master 50 days ago
Assignees
No one assigned
Labels
testing
Vulkan
ggml
Login to write a write a comment.
Login via GitHub