onnxruntime
a8e776b7 - [java] Adds support for fp16 and bf16 tensors (#16703)

Commit

2 years ago

[java] Adds support for fp16 and bf16 tensors (#16703) ### Description The Java API currently only supports fp16 output tensors which it automatically casts to floats on the way out. This PR adds support for creating fp16 and bf16 tensors (from `java.nio.Buffer` objects or as the output of models, creation from Java short arrays is not supported), along with efficient methods for casting `FloatBuffer` into `ShortBuffer` filled with fp16 or bf16 values and vice versa. The fp16 conversions use a trick to pull in the efficient conversion methods added to Java 20, falling back to ports of the MLAS methods otherwise. The Java 20 methods can be special cased by the C2 JIT compiler to emit the single instruction on x86 and ARM which converts fp32<->fp16, or the vectorized versions thereof, so they should be quite a bit faster than the MLAS ported one. ### Motivation and Context fp16 and bf16 are increasingly popular formats and we've had several requests for this functionality. Fixes #7003. cc @yuslepukhin @cassiebreviu --------- Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>

References

#16703 - [java] Adds support for fp16 and bf16 tensors

Author

Craigacp

Parents

1e18efad

onnxruntime a8e776b7 - [java] Adds support for fp16 and bf16 tensors (#16703)

onnxruntime
a8e776b7 - [java] Adds support for fp16 and bf16 tensors (#16703)