llama.cpp
Leverage the existing GGML_F32_VEC helpers to vectorize ggml_vec_set_f32 for faster fills
#16522

Merged

Leverage the existing GGML_F32_VEC helpers to vectorize ggml_vec_set_f32 for faster fills #16522

slaren merged 3 commits into ggml-org:master from sirus20x6:vectorize-vec-set-f32

Leverage the existing GGML_F32_VEC helpers to broadcast the fill valu…

33087594

sirus20x6 requested a review from

ggerganov 169 days ago

sirus20x6 requested a review from

slaren 169 days ago

github-actions added ggml

sirus20x6 marked this pull request as draft 169 days ago

Vectorize additional f32 helper loops

dff11736

sirus20x6 marked this pull request as ready for review 169 days ago

ggerganov commented on 2025-10-13

Normalize f32 helper tails for ggml vec ops

e4189f75

ggerganov approved these changes on 2025-10-14

slaren approved these changes on 2025-10-22

slaren merged 19a5a3ed into master 158 days ago

Reviewers

slaren

ggerganov

Assignees

No one assigned

Labels

ggml

Milestone

No milestone