llama.cpp
2c04beeb - cuda : avoid extra QxQ matrix in shared memory

Commit
1 year ago
cuda : avoid extra QxQ matrix in shared memory
Author
Parents
Loading