llama.cpp
49cc1f7d
- bert : add tests + fix quantization (#5475)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
bert : add tests + fix quantization (#5475) * llama : do not quantize pos embd and token type tensors * ci : add BERT tests ggml-ci * ci : do not do BERT tests on low-perf nodes ggml-ci
References
#5475 - bert : add tests + fix quantization
Author
ggerganov
Parents
99b8b43d
Loading