llama.cpp
49cc1f7d - bert : add tests + fix quantization (#5475)

Commit

2 years ago

bert : add tests + fix quantization (#5475) * llama : do not quantize pos embd and token type tensors * ci : add BERT tests ggml-ci * ci : do not do BERT tests on low-perf nodes ggml-ci

References

#5475 - bert : add tests + fix quantization

Author

ggerganov

Parents

99b8b43d

llama.cpp 49cc1f7d - bert : add tests + fix quantization (#5475)

llama.cpp
49cc1f7d - bert : add tests + fix quantization (#5475)