llama.cpp
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs
#13107
Merged

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs #13107

slaren merged 3 commits into ggml-org:master from SongXiaoXi:master
SongXiaoXi
github-actions github-actions added ggml
SongXiaoXi SongXiaoXi force pushed 1 year ago
SongXiaoXi ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion
c5e3b52c
SongXiaoXi SongXiaoXi force pushed to c5e3b52c 1 year ago
slaren
SongXiaoXi
slaren
SongXiaoXi move fp converter to ggml-cpu
3efb0e73
SongXiaoXi
SongXiaoXi SongXiaoXi changed the title ggml: dynamic x86_64 feature detection for FP32 <-> FP16/BF16 conversion ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs 1 year ago
slaren
slaren approved these changes on 2025-04-26
slaren
SongXiaoXi Switch ggml_compute_forward_get_rows_f16/bf16 to new ggml_cpu_fp16/bf…
82f8630a
slaren slaren merged 77d5e9a7 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone