llama.cpp
eb425962 - llama : do not use KV cache for non-causal models

Commit

1 year ago

llama : do not use KV cache for non-causal models ggml-ci

References

#5796 - llama : fix embeddings

Author

ggerganov

ggerganov

Committer

ggerganov

ggerganov

Parents

Loading