onnxruntime
7fdd3863
- [CUDA] bfloat16 MatMulNBits (#25161)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
212 days ago
[CUDA] bfloat16 MatMulNBits (#25161) ### Description Support bfloat16 for MatMulNBits in CUDA. ### Motivation and Context For LLM model with bfloat16 data type.
References
#25161 - [CUDA] bfloat16 MatMulNBits
Author
tianleiwu
Parents
47ddaaa0
Loading