text-generation-inference
Update the llamacpp backend
#3022
Merged

Update the llamacpp backend #3022

angt merged 17 commits into huggingface:main from angt:llamacpp-update
angt
HuggingFaceDocBuilderDev
angt angt requested a review from mfuntowicz mfuntowicz 1 year ago
angt angt requested a review from Hugoch Hugoch 1 year ago
angt angt requested a review from fgbelidji fgbelidji 1 year ago
mfuntowicz
mfuntowicz dismissed these changes on 2025-02-14
Hugoch
Hugoch commented on 2025-02-14
angt angt dismissed their stale review via 9714f015 1 year ago
Narsil
Narsil
Narsil commented on 2025-02-18
Narsil
Narsil commented on 2025-02-18
Narsil
Narsil dismissed these changes on 2025-02-18
fgbelidji
fgbelidji commented on 2025-02-18
angt angt dismissed their stale review via eeff235c 1 year ago
fgbelidji
fgbelidji approved these changes on 2025-02-19
angt angt marked this pull request as ready for review 1 year ago
angt angt force pushed from 7461a89a to 0e681c79 1 year ago
angt angt force pushed from 8adb9f20 to e9d18b07 362 days ago
angt Build faster
bda39e42
angt Make --model-gguf optional
2d4aa25b
angt Bump llama.cpp
46bc8e6b
angt Enable mmap, offload_kqv & flash_attention by default
30cd3cf5
angt Update doc
2242d1a6
angt Better error message
0d01a89f
angt Update doc
7388468e
angt Update installed packages
961a133d
angt Save gguf in models/MODEL_ID/model.gguf
d41183a0
angt Fix build with Mach-O
6223b6e2
angt Quantize without llama-quantize
0a55bd3d
angt Bump llama.cpp and switch to ggml-org
38492233
angt Remove make-gguf.sh
46feaf62
angt Update Cargo.lock
aadd6249
angt Support HF_HUB_USER_AGENT_ORIGIN
8fe85120
angt Bump llama.cpp
8a79cfd0
angt angt force pushed from e9d18b07 to 8a79cfd0 362 days ago
angt Add --build-arg llamacpp_native & llamacpp_cpu_arm_arch
3f7369d1
mfuntowicz mfuntowicz requested a review from mfuntowicz mfuntowicz 356 days ago
mfuntowicz
mfuntowicz approved these changes on 2025-03-10
angt angt merged 094975c3 into main 356 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone