[CUDA] bfloat16 MatMulNBits #25161
tianleiwu
marked this pull request as draft 214 days ago
support bf16 in MatMulNBits
e6190236
tianleiwu
force pushed
from
b46441f7
to
e6190236
213 days ago
fix test
7a39e01b
tianleiwu
marked this pull request as ready for review 213 days ago
use intrinsic for bf16 to bf162
c95b016f
tianleiwu
merged
7fdd3863
into main 212 days ago
tianleiwu
deleted the tlwu/matmul_nbits_bf16 branch 212 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub