llama.cpp
c80b8a2b - llama : remove mirrors, perform Device -> Host when partial offload

Commit

2 years ago

llama : remove mirrors, perform Device -> Host when partial offload

References

#4309 - llama : per-layer KV cache

Author

ggerganov

ggerganov

Parents

Loading