llama.cpp
0ba20ed9 - llama : compute BERT graph with F16 K, V

Commit
1 year ago
llama : compute BERT graph with F16 K, V ggml-ci
Author
Committer
Parents
Loading