llama.cpp
Refactor/online repacking
#10446
Merged

Refactor/online repacking #10446

Djip007
github-actions github-actions added ggml
Djip007
Djip007 commented on 2024-11-21
Djip007
Djip007 commented on 2024-11-21
Djip007
Djip007 commented on 2024-11-21
Djip007 Djip007 force pushed 1 year ago
slaren
Djip007
Djip007
ggerganov
Djip007
slaren
Djip007
slaren
Djip007 Djip007 force pushed 1 year ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added SYCL
Djip007 Djip007 force pushed 1 year ago
Djip007 Djip007 force pushed 1 year ago
Djip007 Djip007 force pushed 1 year ago
Djip007
Djip007 commented on 2024-11-29
Djip007 Djip007 force pushed 1 year ago
Djip007
Djip007 Djip007 force pushed 1 year ago
Djip007
Djip007 Djip007 force pushed 1 year ago
Djip007 Djip007 force pushed 1 year ago
Djip007
Djip007 Djip007 marked this pull request as ready for review 1 year ago
slaren
slaren commented on 2024-12-02
Djip007
Djip007 Djip007 force pushed 1 year ago
Djip007 Djip007 force pushed 1 year ago
Djip007 Djip007 force pushed 1 year ago
Djip007
slaren
Djip007 Djip007 force pushed 1 year ago
Djip007
slaren
slaren
slaren commented on 2024-12-04
Djip007
Djip007
Djip007
slaren
Djip007 Djip007 force pushed 1 year ago
github-actions github-actions added documentation
github-actions github-actions added examples
github-actions github-actions added python
Djip007 Djip007 force pushed 1 year ago
Djip007
Djip007 rename ggml-cpu-aarch64.c to .cpp
9ac05c11
Djip007 reformat extra cpu backend.
98ea414f
Djip007 clang-format
95322e93
Djip007 Clean Q4_0_N_M ref
3a042b48
Djip007 add op GGML_OP_MUL_MAT_ID for Q4_0_N_M with runtime repack
0a2be72d
slaren
slaren approved these changes on 2024-12-06
slaren slaren requested a review from ggerganov ggerganov 1 year ago
ggerganov
ggerganov approved these changes on 2024-12-06
Djip007
Djip007 commented on 2024-12-06
Djip007 added/corrected control on tensor size for Q4 repacking.
b14b4713
Djip007 Djip007 force pushed to b14b4713 1 year ago
Djip007
ggerganov
ggerganov commented on 2024-12-07
ggerganov
Djip007 Update ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp
7dc8a3e2
Djip007 Update ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp
e115f6f6
Djip007 add debug logs on repacks.
1221d13d
Djip007 Djip007 force pushed to 1221d13d 1 year ago
Djip007
ggerganov
Djip007
ggerganov ggerganov merged 19d8762a into master 1 year ago
Djip007
bartowski1182
slaren

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone