llama.cpp
bert : add tests + fix quantization
#5475
Merged

bert : add tests + fix quantization #5475

ggerganov merged 3 commits into master from gg/ci-add-bert
ggerganov
ggerganov llama : do not quantize pos embd and token type tensors
ce730ad7
ggerganov ci : add BERT tests
09b59430
ggerganov ci : do not do BERT tests on low-perf nodes
1ab4f152
ggerganov ggerganov merged 49cc1f7d into master 1 year ago
ggerganov ggerganov deleted the gg/ci-add-bert branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone