onnxruntime
7fdd3863 - [CUDA] bfloat16 MatMulNBits (#25161)

Commit
212 days ago
[CUDA] bfloat16 MatMulNBits (#25161) ### Description Support bfloat16 for MatMulNBits in CUDA. ### Motivation and Context For LLM model with bfloat16 data type.
Author
Parents
Loading