llama.cpp
10b4f82d
- Added comments explaining thread block size selection logic based on row count and column size, derived from historical commit context (#18212)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
9 days ago
Added comments explaining thread block size selection logic based on row count and column size, derived from historical commit context (#18212)
References
#18212 - ggml : document occupancy heuristics in cuda_op_mean call to reduce_rows kernal
Author
Aadeshveer
Parents
408616ad
Loading