llama.cpp
cuda: fix vmm oom issue on NVIDIA AGX Orin
#4687
Merged

Loading