ggerganov/llama.cpp

Pull Requests Commits

ggml : remove K_QUANTS_PER_ITERATION (#8306)

ggerganov committed 311 days ago

Verified 117f7adb

py : rename requirements for convert_legacy_llama.py

compilade committed 317 days ago

91deef46

gguf-py : use snake_case in scripts entrypoint export

compilade committed 317 days ago

902de882

cont : fix link

ggerganov committed 317 days ago

Verified 3e3cc710

ggerganov committed 317 days ago

Verified c172b322

ggerganov committed 317 days ago

Verified d8f2da6b

py : switch to snake_case

ggerganov committed 317 days ago

Verified 39a41a53

llama : add OpenELM support (#7359)

icecream95 committed 317 days ago

Verified d7fd29ff

tokenize : add --show-count (token) option (#8299)

danbev committed 317 days ago

Verified 6f63d646

build: Export hf-to-gguf as snakecase

ditsuke committed 317 days ago

51d2ebad

doc: Add context for why we add an explicit pytorch source

ditsuke committed 317 days ago

1e920018

chore: Remove rebase artifacts

ditsuke committed 317 days ago

01a5f065

chore: Fixup requirements and build

ditsuke committed 317 days ago

07786a61

chore: ignore all __pychache__

ditsuke committed 317 days ago

de14e2ea

fix: Update script paths in CI scripts

ditsuke committed 317 days ago

82192291

fix: Actually include scripts in build

ditsuke committed 317 days ago

b1c3f26e

build(python): Package scripts with pip-0517 compliance

ditsuke committed 317 days ago

b0a46993

Inference support for T5 and FLAN-T5 model families (#5763)

fairydreaming committed 317 days ago

Verified 807b0c49

tests : add _CRT_SECURE_NO_WARNINGS for WIN32 (#8231)

danbev committed 317 days ago

Verified f8c4c073

llama : suppress unref var in Windows MSVC (#8150)

danbev committed 317 days ago

Verified 402d6fef

convert : fix gemma v1 tokenizer convert (#8248)

ggerganov committed 317 days ago

Verified 20fc3804

[SYCL] Remove unneeded semicolons (#8280)

AidanBeltonS committed 317 days ago

Verified f6190247

Define and optimize RDNA1 (#8085)

daniandtheweb committed 317 days ago

Verified d23287f1

ppl : fix n_seq_max for perplexity (#8277)

slaren committed 318 days ago

Verified 5f2d4e60

fix phi 3 conversion (#8262)

ngxson committed 318 days ago

Verified 916248af

fix typo (#8267)

foldl committed 318 days ago

Verified f8d6a238

Dequant improvements rebase (#8255)

AidanBeltonS committed 318 days ago

Verified fadde671

fix: add missing short command line argument -mli for multiline-input (#8261)

MistApproach committed 319 days ago

Verified a27152b6

Adding step to `clean` target to remove legacy binary names to reduce upgrade / migration confusion arising from #7809. (#8257)

HanClinto committed 319 days ago

Verified 3e2618bc

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)

HanClinto committed 319 days ago

Verified 07a3fc06

Older