ochafik/llama.cpp

Pull Requests Commits

Apply suggestions from code review

ochafik committed 1 year ago

Verified 043cb99f

Merge remote-tracking branch 'origin/master' into r1-toolcall

Olivier Chafik committed 1 year ago

47002452

prefer json::at to operator[] in chat.cpp

Olivier Chafik committed 1 year ago

d52579a9

ggml-cpu : add chunking support to mul_mat_id (#11666)

slaren committed 1 year ago

Verified a394039d

ggml : x2 speed for WASM by optimizing SIMD (#11453)

ngxson committed 1 year ago

Verified be3bbd62

server : (webui) Give copy button back to all message bubbles (#11814)

woof-dog committed 1 year ago

Verified 31afcbee

HIP: Remove GCN from list of devices that avoid MMQ (#11831)

IMbackK committed 1 year ago

Verified 5c4284d5

Fix: Compile failure due to Microsoft STL breaking change (#11836)

MrSMlT committed 1 year ago

Verified bfd11a23

ggerganov committed 1 year ago

Verified 0fb77f82

HIP: Switch to std::vector in rocblas version check (#11820)

IMbackK committed 1 year ago

Verified e598697d

cleanup: fix compile warnings associated with gnu_printf (#11811)

mtmcp committed 1 year ago

Verified fef0cbea

ggml : fix multi-threaded clamp_f32 (#11824)

Burton2000 committed 1 year ago

Verified 748ee9fe

ggml-cpu: Fix duplicate MATMUL_INT8 (#11817)

ownia committed 1 year ago

Verified 198b1ec6

Merge remote-tracking branch 'origin/master' into r1-toolcall

Olivier Chafik committed 1 year ago

37a4bb25

CUDA: fix CUDART_VERSION checks (#11821)

JohannesGaessler committed 1 year ago

Verified c3d6af7c

llama : fix typo in llama-grammar.h [no ci] (#11816)

danbev committed 1 year ago

Verified 369be559

docs: add OpenCL (#11697)

lhez committed 1 year ago

Verified 4078c77f

Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (#11803)

sheldonrobinson committed 1 year ago

Verified 90e4dba4

server : use common_token_to_piece instead of common_detokenize (#11740)

danbev committed 1 year ago

Verified a18f481f

CUDA: use arch list for compatibility check (#11775)

JohannesGaessler committed 1 year ago

Verified b9ab0a4d

fix: typos in documentation files (#11791)

maximevtush committed 1 year ago

Verified 7b891bdc

docs: utilize the forward slash (/) as the path separator for Unix-like systems (#11770)

MambaWong committed 1 year ago

Verified 81732619

server : (webui) introduce conversation branching + idb storage (#11792)

ngxson committed 1 year ago

Verified 507f9174

llama-mmap: fix missing include (#11796)

wgottwalt committed 1 year ago

Verified 19b392d5

server : correct signal handler (#11795)

ngxson committed 1 year ago

Verified 0893e011

sync: minja (https://github.com/google/minja/commit/a72057e5190de2c612d4598bb10b4bfd0f53011f) (#11774)

ochafik committed 1 year ago

Verified d7b31a9d

Update README.md [no ci] (#11781)

pascal-lc committed 1 year ago

Verified 9ac3457b

vulkan: Make Vulkan optional at runtime (#11493). (#11494)

daym committed 1 year ago

Verified c2a67efe

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (#11592)

wbruna committed 1 year ago

Verified b044a0fe

fix test-chat (update delta to latest r1 template change)

ochafik committed 1 year ago

01db4291

Older