llama.cpp
ggml : use WARP_SIZE/2 for argmax reduction offset
#18092
Merged

ggml : use WARP_SIZE/2 for argmax reduction offset #18092

Aadeshveer
Aadeshveer ggml : use WARP_SIZE/2 for argmax reduction offset
c2f3f7a2
am17an
am17an approved these changes on 2025-12-16
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler
JohannesGaessler approved these changes on 2025-12-16
Aadeshveer
am17an am17an merged 58062860 into master 176 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone