llama.cpp
ggml-cpu: optimize ggml_vec_dot_bf16 for Power9
#18837
Merged

ggml-cpu: optimize ggml_vec_dot_bf16 for Power9 #18837

shalinib-ibm
shalinib-ibm shalinib-ibm requested a review from ggerganov ggerganov 68 days ago
shalinib-ibm shalinib-ibm force pushed from 92cac28b to 7cbfb64a 68 days ago
shalinib-ibm shalinib-ibm force pushed from 7cbfb64a to e9cde100 68 days ago
shalinib-ibm ggml-cpu: optimize ggml_vec_dot_bf16 for Power9
cfd1ed22
shalinib-ibm shalinib-ibm force pushed from e9cde100 to cfd1ed22 68 days ago
github-actions github-actions added ggml
taronaeo
taronaeo
taronaeo approved these changes on 2026-01-14
shalinib-ibm Update ggml/src/ggml-cpu/simd-mappings.h
0ca6fafa
taronaeo
shalinib-ibm Update simd-mappings.h
7b935c46
shalinib-ibm Update vec.cpp
d13f4c3d
taronaeo
shalinib-ibm Handle endian-ness during vec_mergeh/l while converting BF16 to FP32
5abb3897
shalinib-ibm
taronaeo
taronaeo approved these changes on 2026-01-15
shalinib-ibm Update ggml/src/ggml-cpu/simd-mappings.h
40a9fed2
taronaeo
shalinib-ibm Merge branch 'ggml-org:master' into vec_dot_bf16_opt
9d235ca0
taronaeo
taronaeo taronaeo merged 8cc0ba95 into master 67 days ago
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone