cuda : optimize argmax #10441
cuda : optimize argmax
35386e89
remove unused parameter
0a737d21
fixup : use full warps
1e9447a0
ggerganov
approved these changes
on 2024-11-21
Apply suggestions from code review
a734da71
fix ub
316f3d31
ggml : check ne00 <= INT32_MAX in argmax and argsort
48f94d41
slaren
merged
a5e47592
into master 1 year ago
slaren
deleted the sl/cuda-opt-argmax branch 1 year ago
Assignees
No one assigned
Labels
testing
Nvidia GPU
ggml
Login to write a write a comment.
Login via GitHub