bert : add tests + fix quantization #5475
llama : do not quantize pos embd and token type tensors
ce730ad7
ci : add BERT tests
09b59430
ci : do not do BERT tests on low-perf nodes
1ab4f152
ggerganov
merged
49cc1f7d
into master 1 year ago
ggerganov
deleted the gg/ci-add-bert branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub