llama.cpp
d1e35564 - CUDA: Replace init_offsets kernel with iterators in cub-based argsort (#18930)

Commit
55 days ago
CUDA: Replace init_offsets kernel with iterators in cub-based argsort (#18930) * CUDA: Replace `init_offsets` with iterators in argsort This is a QOL improvement, saving us the cost of materializing the iterator * Remove unnecessary include from top-k.cu
Author
Parents
Loading