llvm-project
49995b2a - [MLIR][GPU] subgroup_mma fp64 extension (#165873)

Commit
49 days ago
[MLIR][GPU] subgroup_mma fp64 extension (#165873) This PR extends the `gpu.subgroup_mma_*` ops to support fp64 type. The extension requires special handling during the lowering to `nvvm` due to the return type for load ops for fragment a and b (they return a scalar instead of a struct).
Author
Parents
Loading