ochafik/llama.cpp

Pull Requests Commits

more layer specific names

ochafik committed 2 years ago

1df05959

give llama tensors layer-specific names

ochafik committed 2 years ago

bed6e155

copy graph inputs to metal

ochafik committed 2 years ago

75125243

move helpers around

ochafik committed 2 years ago

8012888a

Make KQ_scale an input too

ochafik committed 2 years ago

868c55f1

Copy tokens or embeddings inputs outside of graph builders

ochafik committed 2 years ago

c2558429

convert.py : use dir name to name the llama

ggerganov committed 2 years ago

Verified b532a69b

examples : fix underscore in beam-search + .gitignore (close #2900)

ggerganov committed 2 years ago

Verified c90d135e

gguf : add workflow for Pypi publishing (#2896)

monatis committed 2 years ago

Verified 0d1c7061

make : add test and update CI (#2897)

alonfaraj committed 2 years ago

Verified 95092944

docs : add `node-llama-cpp` to `README.md` (#2885)

giladgd committed 2 years ago

Verified 35092fb5

convert : various script cleanups/fixes + merges and special token handling (#2842)

KerfuffleV2 committed 2 years ago

Verified dc07dc49

llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879)

chaihahaha committed 2 years ago

Verified ad9ddcff

main : log file (#2748)

staviq committed 2 years ago

Verified 8341a259

tests : add a C compliance test (#2848)

cebtenzzre committed 2 years ago

Verified 84940895

ggml : add view_src and view_offs to ggml_tensor for views (#2874)

slaren committed 2 years ago

Verified 06abf8ee

remove outdated references to -eps and -gqa from README (#2881)

slaren committed 2 years ago

Verified c03a243a

Tell users attmepting to run perplexity with too few tokens to use more (#2882)

ikawrakow committed 2 years ago

Verified fa3582f5

10X faster BPE tokenizer (#2876)

ikawrakow committed 2 years ago

Verified e37e69dc

py : fix "usage" messages (#2873)

maddes8cht committed 2 years ago

Verified 53885d72

convert.py : fix baichuan7B support (#2870)

jameswu2014 committed 2 years ago

Verified bcce96ba

readme : add react-native binding (#2869)

jhen0409 committed 2 years ago

Verified 74e0caeb

make : fix clang tests build, add missing examples (#2859)

cebtenzzre committed 2 years ago

Verified d4b5e16c

metal : add option to disable debug logs (close #2764)

ggerganov committed 2 years ago

Verified 3a007648

scripts : add pipefail

ggerganov committed 2 years ago

611363ac

added `struct` to llama_dump_timing_info_yaml's `llama_context` (#2857)

MarcusDunn committed 2 years ago

Verified 95b6e521

train : mem usage and other improvements (#2439)

xaedes committed 2 years ago

Verified 44c117f4

llama-bench : set locale to utf8 (#2832)

slaren committed 2 years ago

Verified 43033b7b

YAML result logging + preset script (#2657)

JohannesGaessler committed 2 years ago

Verified 6b73ef12

make : fix tests build (#2855)

alonfaraj committed 2 years ago

Verified 75fafcbc

Older