onnxruntime
[CUDA] bfloat16 MatMulNBits
#25161
Merged

[CUDA] bfloat16 MatMulNBits #25161

tianleiwu merged 3 commits into main from tlwu/matmul_nbits_bf16
tianleiwu
tianleiwu tianleiwu marked this pull request as draft 214 days ago
tianleiwu support bf16 in MatMulNBits
e6190236
tianleiwu tianleiwu force pushed from b46441f7 to e6190236 213 days ago
github-actions
github-actions commented on 2025-06-26
tianleiwu fix test
7a39e01b
tianleiwu tianleiwu marked this pull request as ready for review 213 days ago
tianleiwu use intrinsic for bf16 to bf162
c95b016f
tianleiwu tianleiwu requested a review from nenad1002 nenad1002 213 days ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 213 days ago
tianleiwu tianleiwu requested a review from jiafatom jiafatom 213 days ago
nenad1002
nenad1002 commented on 2025-06-26
kunal-vaishnavi
kunal-vaishnavi commented on 2025-06-26
kunal-vaishnavi
kunal-vaishnavi commented on 2025-06-26
kunal-vaishnavi
kunal-vaishnavi commented on 2025-06-26
kunal-vaishnavi
kunal-vaishnavi commented on 2025-06-26
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2025-06-26
tianleiwu tianleiwu merged 7fdd3863 into main 212 days ago
tianleiwu tianleiwu deleted the tlwu/matmul_nbits_bf16 branch 212 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone