llama.cpp
49cc1f7d - bert : add tests + fix quantization (#5475)

Commit
1 year ago
bert : add tests + fix quantization (#5475) * llama : do not quantize pos embd and token type tensors * ci : add BERT tests ggml-ci * ci : do not do BERT tests on low-perf nodes ggml-ci
Author
Parents
Loading