llama.cpp
0f2e42ca
- CUDA: only allocate FA tmp buffer if needed (#18564)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
51 days ago
CUDA: only allocate FA tmp buffer if needed (#18564)
References
#18564 - CUDA: only allocate FA tmp buffer if needed
Author
JohannesGaessler
Parents
9dba9f53
Loading