llama.cpp
0f2e42ca - CUDA: only allocate FA tmp buffer if needed (#18564)

Commit
51 days ago
CUDA: only allocate FA tmp buffer if needed (#18564)
Parents
Loading