onnxruntime
810512c6 - [ROCm] TunableOp: add hipBLASLt tuning logic (#16338)

Commit
2 years ago
[ROCm] TunableOp: add hipBLASLt tuning logic (#16338) ### Description - Add hipBLASLt tuning logic in place of default hipBLASLt implementation; - add kernel explorer for hipBLASLt. related operators: Gemm, StridedBatchedGemm, and GemmFastGelu. Temporarily mark algos that require extra workspace as unsupported. Will add workspace support in later PR, which will change Gemm Params def and affect multiple files.
Author
Parents
Loading