llvm-project
a800ffac - [mlir][gpu] Disjoint patterns for lowering clustered subgroup reduce (#109158)

Commit
1 year ago
[mlir][gpu] Disjoint patterns for lowering clustered subgroup reduce (#109158) Making the existing populateGpuLowerSubgroupReduceToShufflePatterns() function also cover the new "clustered" subgroup reductions is proving to be inconvenient, because certain backends may have more specific lowerings that only cover the non-clustered type, and this creates pass ordering constraints. This commit removes coverage of clustered reductions from this function in favour of a new separate function, which makes controlling the lowering much more straightforward.
Author
Parents
Loading