llama.cpp
ggml-cpu: enable IBM NNPA Vector Intrinsics
#14317
Merged

ggml-cpu: enable IBM NNPA Vector Intrinsics #14317

taronaeo
taronaeo ggml-cpu: add nnpa compile flag
5801806f
taronaeo ggml-cpu: add fp16->fp32 nnpa first
45a4cf65
taronaeo ggml-cpu: add fp32->fp16
ebf9f34a
taronaeo ggml-cpu: better variable names
ffe29645
taronaeo docs: update s390x docs
0394a006
taronaeo ggml-cpu: add debugging prints to see if dlf16 is correct
48b820d0
taronaeo ggml-cpu: fix print vs printf
d9cc63a9
taronaeo ggml-cpu: fix float placeholder
94f10ca1
taronaeo ggml-cpu: ensure fp16 and fp32 load and stores are called
8f3a5af6
taronaeo ggml-cpu: fp16 load ensured to hit
575ea9f6
taronaeo ggml-cpu: remove sigint from fp16 store
9330454c
taronaeo ggml-cpu: activate nnpa for ggml_cpu_fp16_to_fp32
ebc1d19f
taronaeo ggml-cpu: nnpa activate ggml_cpu_fp16_to_fp32 for 8 elements
6a25fd85
taronaeo ggml-cpu: nnpa switch to vec_xst test
f9f6c7e8
taronaeo ggml-cpu: switch to vec_xst for 4 element loops also
6d507bbe
taronaeo ggml-cpu: rework noop
8312adc9
taronaeo ggml-cpu: remove noop, general code cleanup
27b4c3f3
taronaeo ggml-cpu: clarify variable naming
e0f8fb93
taronaeo ggml-cpu: activate nnpa for ggml_cpu_fp32_to_fp16
bb9345ca
taronaeo ggml-cpu: add breakpoint for debugging
5424d9e7
taronaeo ggml-cpu: test fix for conversion failure
4f017d71
taronaeo ggml-cpu: disable fp32->fp16 nnpa conversions for now
27131e5f
taronaeo ggml-cpu: switch to elif macro
946c78eb
taronaeo ggml-cpu: reattempt fp32->fp16
433d5874
taronaeo ggml-cpu: fix typo
54811fc1
taronaeo ggml-cpu: reattempt fp32->fp16
e12e9fe7
taronaeo ggml-cpu: fix compiler types
7413dabc
taronaeo ggml-cpu: change to typedef vector types
373fa28e
taronaeo ggml-cpu: add 4 element loops for fp32->fp16
4621a23c
taronaeo ggml-cpu: clarified vector naming
987d1690
taronaeo ggml-cpu: bring back fp32->fp16 store nnpa
8ef51b90
taronaeo ggml-cpu: activate nnpa fp32->fp16 or fp16->fp32 compute
f1b1d98e
taronaeo ggml-cpu: add nnpa macro check in ggml-impl
1547ea23
taronaeo ggml-cpu: add missing __func__
0e571dd3
taronaeo ggml-cpu: diagnose why __NNPA__ macro is not being defined
4ad6efa3
taronaeo ggml-cpu: import vecintrin.h to fix compiler errors
81298380
taronaeo ggml-cpu: update macro tests
e7910fc9
taronaeo ggml-cpu: move s390x typedef to own header file
157f856c
taronaeo Revert "ggml-cpu: move s390x typedef to own header file"
48df9770
taronaeo ggml-cpu: switch to importing ggml-cpu-impl instead
3004a79f
taronaeo ggml-cpu: fix macro declaration
1cacdd9a
taronaeo ggml-cpu: test more macros
fadc1387
taronaeo ggml-cpu: add debug prints
ed76ff6e
taronaeo ggml-cpu: bruteforce macro definitions
84593387
taronaeo ggml-cpu: move macro definitions
72c91436
taronaeo ggml-cpu: add ggml-impl.h to cmakelists
a91c3ab6
taronaeo ggml-cpu: switch to private macros
ba3513e4
taronaeo ggml-cpu: move s390x typedef to own header file
18d79e1a
taronaeo ggml-cpu: move things around
781c2637
taronaeo ggml-cpu: bring back compile macros
263b820b
taronaeo ggml-cpu: switch to quotes for import
04a395ea
taronaeo ggml-cpu: add compiler error macro
c8b3b895
taronaeo ggml-cpu: add s390x detection in ggml-src
ebb8489a
taronaeo ggml-cpu: bring back compile definitions
3ec0bdc1
taronaeo ggml-cpu: undo cmakelists work
e43dc82a
taronaeo Revert "ggml-cpu: move s390x typedef to own header file"
5c9b0835
taronaeo ggml-cpu: remove typedefs.h
1b4dbf47
taronaeo ggml-cpu: remove typedef from cmakelists
46227c61
taronaeo ggml-cpu: add ggml-impl.h future notes
72965ea8
taronaeo ggml-cpu: add todo comment for future reference
07de57c6
github-actions github-actions added documentation
github-actions github-actions added ggml
taronaeo ggml-cpu: clarify naming of dlf16
489cdf44
taronaeo ggml-cpu: remove unnecessary target compile definitions
5004e439
slaren
taronaeo ggml-cpu: move nnpa fp16->fp32 and fp32->fp16 to simd-mappings
5834dee1
taronaeo
slaren
slaren commented on 2025-06-23
taronaeo ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu
bd288e8f
taronaeo docs: update broken huggingface link for s390x
4d136cb6
taronaeo ggml-cpu: fix duplicate func names during compile
fbb73345
taronaeo Revert "ggml-cpu: fix duplicate func names during compile"
e73413bb
taronaeo Revert "ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu"
8a5e011c
taronaeo ggml: refactor fp16<->fp32 simd to ggml-cpu
17b032fa
taronaeo ggml-cpu: fix missing simd-mappings.h import in quants.c
0367b803
taronaeo ggml-cpu: fix missing simd-mappings.h within repack
e615f73b
slaren
slaren commented on 2025-06-24
taronaeo ggml-cpu: fix amx mmq missing simd-mappings.h
3c055a42
taronaeo ggml-cpu: attempt at fixing loongarch failing build
e4666f93
taronaeo ggml-cpu: move nnpa together with other fp16<->fp32 simd
e4a7f84d
taronaeo ggml-cpu: fix wrong refactor of ggml-base
1e6ebb2b
taronaeo ggml: remove dependency on ggml-cpu from ggml-base
64568ffb
taronaeo
slaren
taronaeo
taronaeo taronaeo requested a review from slaren slaren 336 days ago
slaren
slaren commented on 2025-06-24
taronaeo ggml-cpu: rename all fp16<->fp32 macros to prefix with ggml_cpu
a02b360f
taronaeo ggml-cpu: remove mistaken fallback macro
1b23fec0
taronaeo
slaren
slaren slaren requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 336 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-06-24
slaren
slaren commented on 2025-06-24
taronaeo
taronaeo
taronaeo ggml: move ggml_table_f32_f16 to ggml-cpu
9e40d984
taronaeo ggml-cpu: move ggml_table_f32_f16 back to ggml-base due to ci failures
32a35335
taronaeo Revert "ggml-cpu: move ggml_table_f32_f16 back to ggml-base due to ci…
827fce9c
taronaeo Revert "ggml: move ggml_table_f32_f16 to ggml-cpu"
5be39c11
slaren
taronaeo ggml: move ggml_table_f32_f16 to ggml-cpu
59b48e48
taronaeo ggml: move ggml_table_f32_f16 to ggml-cpu.c
6cebee25
slaren
slaren commented on 2025-06-25
slaren
slaren
slaren commented on 2025-06-25
taronaeo ggml-cpu: extern c ggml_table_f32_f16 + chore docs
5f2a09a8
taronaeo
taronaeo ggml-cpu: dedup ggml_table_f32_f16 from simd-mappings.h
f71b21d2
taronaeo Revert "ggml-cpu: dedup ggml_table_f32_f16 from simd-mappings.h"
176e1db6
taronaeo ggml-cpu: bring back ggml_table_f32_f16
2dce1191
taronaeo Revert "ggml-cpu: bring back ggml_table_f32_f16"
bb35ea67
taronaeo
slaren fix ggml time initialization
8efdc0b5
slaren fix f32_f16 table init
4ce16fa0
slaren
slaren commented on 2025-06-25
slaren remove extra line
97620ac1
slaren
slaren approved these changes on 2025-06-25
slaren slaren merged 60ef23d6 into master 335 days ago
CISC
taronaeo
slaren
Rbiessy
slaren
Rbiessy
Rbiessy
slaren

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone