llvm-project
ff5f396d - AMDGPU: Handle rewriting non-tied MFMA to AGPR form (#153015)

Commit
32 days ago
AMDGPU: Handle rewriting non-tied MFMA to AGPR form (#153015) If src2 and dst aren't the same register, to fold a copy to AGPR into the instruction we also need to reassign src2 to an available AGPR. All the other uses of src2 also need to be compatible with the AGPR replacement in order to avoid inserting other copies somewhere else. Perform this transform, after verifying all other uses are compatible with AGPR, and have an available AGPR available at all points (which effectively means rewriting a full chain of mfmas and load/store at once).
Author
Parents
Loading