llama.cpp
ggml: add ops for WAN video model (cuda && cpu)
#15669
Merged

ggml: add ops for WAN video model (cuda && cpu) #15669

JohannesGaessler merged 31 commits into ggml-org:master from leejet:wan
leejet
leejet add conv3d support
c92f9b4a
leejet add ggml_pad_ext for cpu & cuda backend
93c7e775
leejet cuda/cpu: add im2col_3d support
f7a12f9e
leejet cuda: make im2col a little faster
85c8e1e5
leejet fix cuda pad/scale/im2col3d
ae47caca
leejet make im2col_3d faster
dd745ba3
leejet gguf: support loading tensors which n_dims > GGML_MAX_DIMS
d8377a0a
leejet fix cuda get_rows
d30e07db
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
github-actions github-actions added Ascend NPU
leejet avoid ggml_conv_3d conflict
df05913b
github-actions github-actions added testing
leejet correct GGML_OP_COUNT assertion
9d035c4c
leejet avoid build failure
d11a7298
leejet avoid build failure on MacOS
f6a874c0
leejet cuda: remove unnecessary MIN define
f6278c83
jeffbolznv
leejet
jeffbolznv
leejet fix cpu im2col_3d
c9b9fabe
leejet
Acly
Acly commented on 2025-08-30
JohannesGaessler
JohannesGaessler commented on 2025-08-30
JohannesGaessler
JohannesGaessler commented on 2025-08-30
leejet adjust the code style
131ae2d5
leejet cuda: use simpler loop in get_rows
0d5eb512
leejet add test_im2col_3d to test-backend-ops
aafa79ae
leejet
leejet test-backend-ops.cpp: remove trailing whitespace
3f901e31
jeffbolznv
jeffbolznv
jeffbolznv
jeffbolznv
jeffbolznv commented on 2025-08-31
leejet
leejet
jeffbolznv
leejet cpu: im2col_3d support non continuous src
e66bf6e5
leejet fix test_im2col_3d
b4c50bec
leejet remove unused variables
8f5e7b0c
leejet cuda: get_rows: dfloat2 -> float2
21e93380
JohannesGaessler
JohannesGaessler
JohannesGaessler commented on 2025-08-31
leejet
leejet add test_pad_ext to test-backend-ops.cpp
36f2215e
JohannesGaessler
JohannesGaessler commented on 2025-09-02
jeffbolznv
leejet add gguf_init_from_file_ext impl
d9f1d132
leejet
leejet Merge branch 'master' into wan
6b71242a
JohannesGaessler
leejet Revert "gguf: support loading tensors which n_dims > GGML_MAX_DIMS"
9b365e83
leejet Revert "add gguf_init_from_file_ext impl"
6b6eeded
leejet
CISC
JohannesGaessler
pwilkin
leejet
JohannesGaessler
leejet update ggml_backend_vk_device_supports_op
2412bb0b
leejet leejet requested a review from 0cc4m 0cc4m 106 days ago
github-actions github-actions added Vulkan
leejet fix ggml_backend_vk_device_supports_op
b38bfbb5
leejet update other backend supports op for ggml_pad_ext
457f1864
0cc4m
github-actions github-actions added SYCL
github-actions github-actions added Apple Metal
github-actions github-actions added OpenCL
leejet
leejet metal/opencl/sycl/vulkan: fix GGML_OP_PAD check in supports_op
1618844e
leejet
JohannesGaessler
JohannesGaessler approved these changes on 2025-09-04
JohannesGaessler JohannesGaessler merged 0a1b3982 into master 105 days ago
leejet

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone