llama.cpp
vulkan: Add fusion support for RMS_NORM+MUL
#14366
Merged

vulkan: Add fusion support for RMS_NORM+MUL #14366

0cc4m merged 12 commits into ggml-org:master from jeffbolznv:rms_mul_fusion_2
jeffbolznv
jeffbolznv jeffbolznv requested a review from slaren slaren 84 days ago
github-actions github-actions added testing
github-actions github-actions added Vulkan
github-actions github-actions added ggml
jeffbolznv jeffbolznv marked this pull request as draft 84 days ago
jeffbolznv
jeffbolznv commented on 2025-06-24
jeffbolznv
jeffbolznv vulkan: Add fusion support for RMS_NORM+MUL
8643fea3
jeffbolznv extract some common fusion logic
18f2d000
jeffbolznv fix -Winconsistent-missing-override
b84cb4ac
jeffbolznv move ggml_can_fuse to a common function
5e13dcf7
jeffbolznv jeffbolznv force pushed from c2245299 to 5e13dcf7 82 days ago
jeffbolznv
jeffbolznv commented on 2025-06-26
jeffbolznv build fix
e6f3c067
jeffbolznv C and C++ versions of can_fuse
da2fce0b
slaren
jeffbolznv
jeffbolznv move use count to the graph to avoid data races and double increments…
8c50a9b4
slaren
slaren commented on 2025-06-27
jeffbolznv use hash table lookup to find node index
9ddd4252
jeffbolznv change use_counts to be indexed by hash table slot
16304359
slaren minimize hash lookups
2dbac5a0
slaren
slaren approved these changes on 2025-06-28
ggerganov
ggerganov approved these changes on 2025-06-28
jeffbolznv last node doesn't need single use.
6894905c
jeffbolznv jeffbolznv marked this pull request as ready for review 80 days ago
jeffbolznv jeffbolznv requested a review from 0cc4m 0cc4m 80 days ago
0cc4m
0cc4m approved these changes on 2025-06-28
jeffbolznv remove redundant parameter
a75e9837
0cc4m 0cc4m merged bd9c981d into master 79 days ago
CISC
0cc4m
CISC
CISC
CISC
jeffbolznv
CISC
CISC
MaggotHATE
0cc4m
MaggotHATE
jeffbolznv
MaggotHATE
jeffbolznv
MaggotHATE
MaggotHATE
0cc4m
MaggotHATE
0cc4m
MaggotHATE

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone