llama.cpp
7cdd30bf - cuda : allocate all temporary ggml_tensor_extra_gpu from a fixed-size buffer (#2220)

Commit
2 years ago
cuda : allocate all temporary ggml_tensor_extra_gpu from a fixed-size buffer (#2220)
Author
Parents
Loading