BF16 and ne11 <= 16 (#15131)

Commit

64 days ago

CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131) * CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16

References

Author

JohannesGaessler

Parents