onnxruntime
[CUDA] fp16 intB gemm scale only kernel
#24955
Merged

[CUDA] fp16 intB gemm scale only kernel #24955

tianleiwu merged 6 commits into main from tlwu/fpA_intB_gemm_scale_only
tianleiwu
tianleiwu support scale only
ff351bb2
tianleiwu fix build
2b3676c4
tianleiwu format
8c658bff
tianleiwu refactoring
80d6fba9
tianleiwu format
12e9fd80
tianleiwu fix build for sm_52
6d24421f
tianleiwu tianleiwu requested a review from nenad1002 nenad1002 209 days ago
tianleiwu tianleiwu requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 209 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-06-04
tianleiwu tianleiwu requested a review from jiafatom jiafatom 209 days ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 209 days ago
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2025-06-05
kunal-vaishnavi
kunal-vaishnavi commented on 2025-06-05
tianleiwu tianleiwu merged ab5ff6a9 into main 209 days ago
tianleiwu tianleiwu deleted the tlwu/fpA_intB_gemm_scale_only branch 209 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone