sync : ggml #2444

ggerganov merged 18 commits into master from sync
ggerganov
ggerganov ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969)
443678a5
smeso vulkan : argsort barriers must be under uniform control flow (ggml/951)
8fc92397
jeffbolznv vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml…
45c860f0
jeffbolznv vulkan : multithread pipeline creation (ggml/963)
ec8e9191
JohannesGaessler CUDA: remove bad assert (ggml/972)
44e2d393
bachelor-dou cann: fix crash when llama-bench is running on multiple cann devices …
d2eac9f1
chaxu01 ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217)
b0978f08
yeahdongcn mtgpu: enable VMM (llama/9597)
04485511
mtavenrath Enable use to the rebar feature to upload buffers to the device. (lla…
fd5cb2bf
eddnjjn ggml : add run-time detection of neon, i8mm and sve (llama/9331)
a98b5fa0
ggerganov ggml : define missing HWCAP flags (llama/9684)
9d176caa
JohannesGaessler ggml: fix gradient allocation logic (ggml/966)
034ed81a
iboB ggml : fix ggml_cast (ggml/973)
ee8e29c6
smeso vulkan : mul_mat: fix UB with small warps (ggml/952)
28a3391e
JohannesGaessler test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)
2eda43aa
ggerganov sync : ggml
fce227e5
ggerganov metal : reduce command encoding overhead (llama/9698)
f0839085
ggerganov talk-llama : sync llama.cpp
b4c96313
ggerganov ggerganov merged ccc25472 into master 1 year ago
ggerganov ggerganov deleted the sync branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone