onnxruntime
[CUDA] MatMulNBits benchmark
#24564
Merged

[CUDA] MatMulNBits benchmark #24564

snnn merged 4 commits into main from tlwu/benchmark_matmul_8bits
tianleiwu
tianleiwu Add benchmark script
9308f462
tianleiwu choose unroll kernel
705185e9
tianleiwu Replace unroll with simple loop
492b8dac
tianleiwu refine accumulation
48cb505e
tianleiwu tianleiwu requested a review from jiafatom jiafatom 1 year ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
snnn
azure-pipelines
kunal-vaishnavi
tianleiwu tianleiwu added release:1.22.0
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2025-04-26
tianleiwu
snnn snnn merged 1dd9b992 into main 1 year ago
snnn snnn deleted the tlwu/benchmark_matmul_8bits branch 1 year ago
snnn snnn removed release:1.22.0
snnn

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone