llama.cpp
fe00a84b - tests: enable kv_unified to prevent cuda oom error on rtx 2060 (#20645)

Commit
17 days ago
tests: enable kv_unified to prevent cuda oom error on rtx 2060 (#20645) Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
Author
Parents
Loading