llama.cpp
65cdf34b - llama : use n_embd_gqa instead of n_embd to handle llama-2 70B (#2433)

Commit
2 years ago
llama : use n_embd_gqa instead of n_embd to handle llama-2 70B (#2433)
Author
Parents
Loading