llama.cpp
Refactor/online repacking
#10446
Merged

Refactor/online repacking #10446

Djip007
github-actions github-actions added ggml
Djip007
Djip007 commented on 2024-11-21
Djip007
Djip007 commented on 2024-11-21
Djip007
Djip007 commented on 2024-11-21
Djip007 Djip007 force pushed from 36a0406f to 655a3fbd 297 days ago
slaren
Djip007
Djip007
ggerganov
Djip007
slaren
Djip007
slaren
Djip007 Djip007 force pushed from 655a3fbd to e772df4f 290 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added SYCL
Djip007 Djip007 force pushed from e772df4f to 00e6a3f2 290 days ago
Djip007 Djip007 force pushed from 00e6a3f2 to a411d958 290 days ago
Djip007 Djip007 force pushed from a411d958 to fd768e04 290 days ago
Djip007
Djip007 commented on 2024-11-29
Djip007 Djip007 force pushed from fd768e04 to 16154eb7 290 days ago
Djip007
Djip007 Djip007 force pushed from 16154eb7 to dc8adeb1 290 days ago
Djip007
Djip007 Djip007 force pushed from dc8adeb1 to 1b29245e 287 days ago
Djip007 Djip007 force pushed from 1b29245e to 733f8916 287 days ago
Djip007
Djip007 Djip007 marked this pull request as ready for review 287 days ago
slaren
slaren commented on 2024-12-02
Djip007
Djip007 Djip007 force pushed from 733f8916 to e8a75e60 286 days ago
Djip007 Djip007 force pushed from e8a75e60 to 66be52c9 286 days ago
Djip007 Djip007 force pushed from 66be52c9 to 10df7d09 286 days ago
Djip007
slaren
Djip007 Djip007 force pushed from cb557b45 to ccdc7709 286 days ago
Djip007
slaren
slaren
slaren commented on 2024-12-04
Djip007
Djip007
Djip007
slaren
Djip007 Djip007 force pushed from f9e92183 to dc5def81 284 days ago
github-actions github-actions added documentation
github-actions github-actions added examples
github-actions github-actions added python
Djip007 Djip007 force pushed from dc5def81 to c5aa5b9c 284 days ago
Djip007
Djip007 rename ggml-cpu-aarch64.c to .cpp
9ac05c11
Djip007 reformat extra cpu backend.
98ea414f
Djip007 clang-format
95322e93
Djip007 Clean Q4_0_N_M ref
3a042b48
Djip007 add op GGML_OP_MUL_MAT_ID for Q4_0_N_M with runtime repack
0a2be72d
slaren
slaren approved these changes on 2024-12-06
slaren slaren requested a review from ggerganov ggerganov 283 days ago
ggerganov
ggerganov approved these changes on 2024-12-06
Djip007
Djip007 commented on 2024-12-06
Djip007 added/corrected control on tensor size for Q4 repacking.
b14b4713
Djip007 Djip007 force pushed from 8e5bd043 to b14b4713 282 days ago
Djip007
ggerganov
ggerganov commented on 2024-12-07
ggerganov
Djip007 Update ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp
7dc8a3e2
Djip007 Update ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp
e115f6f6
Djip007 add debug logs on repacks.
1221d13d
Djip007 Djip007 force pushed from be3c64b2 to 1221d13d 281 days ago
Djip007
ggerganov
Djip007
ggerganov ggerganov merged 19d8762a into master 281 days ago
Djip007
bartowski1182
slaren

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone