llama.cpp
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs
#13107

Merged

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs #13107

slaren merged 3 commits into ggml-org:master from SongXiaoXi:master

github-actions added ggml

SongXiaoXi force pushed 1 year ago

ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion

c5e3b52c

SongXiaoXi force pushed to c5e3b52c 1 year ago

move fp converter to ggml-cpu

3efb0e73

SongXiaoXi changed the title ~~ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion~~ ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs 1 year ago

slaren approved these changes on 2025-04-26

Switch ggml_compute_forward_get_rows_f16/bf16 to new ggml_cpu_fp16/bf…

82f8630a

slaren merged 77d5e9a7 into master 1 year ago

Reviewers

slaren

Assignees

No one assigned

Labels

ggml

Milestone

No milestone

llama.cpp ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs #13107 Merged

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs #13107

llama.cpp
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs
#13107

Merged