llama.cpp
10b4f82d
- Added comments explaining thread block size selection logic based on row count and column size, derived from historical commit context (#18212)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
151 days ago
Added comments explaining thread block size selection logic based on row count and column size, derived from historical commit context (#18212)
References
#18212 - ggml : document occupancy heuristics in cuda_op_mean call to reduce_rows kernal
Author
Aadeshveer
Parents
408616ad
Loading