llama.cpp
2c04beeb - cuda : avoid extra QxQ matrix in shared memory

Commit
2 years ago
cuda : avoid extra QxQ matrix in shared memory
Author
Parents
Loading