llama.cpp
fe00a84b - tests: enable kv_unified to prevent cuda oom error on rtx 2060 (#20645)

Commit

67 days ago

tests: enable kv_unified to prevent cuda oom error on rtx 2060 (#20645) Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

References

Author

taronaeo

Parents