AMDGPU: Fix mfma agpr allocation failures with -O0
Previously we were getting lucky on cases that can use AV registers
with the normal optimization pipeline.
I do not understand what the check against getAddressableNumArchVGPRs
was doing here. This logic needs to be consistent with getMaxNumVectorRegs,
as that is what getReservedRegs to determine the AGPR budget. In the future
we should directly check the minimum AGPR budget, and individual selection
patterns need to know the minimum budget required for them.