[ROCDL] Added asynccnt and tensorcnt wait ops (gfx1250) (#163533)
This patch introduces some missing `s.wait.asynccnt` and `tensorcnt`
instructions in the ROCDL dialect.
The op is lowered to `@llvm.amdgcn.s.wait.asynccnt(i16 0)` and
`@llvm.amdgcn.s.wait.tensorcnt(i16 0)` intrinsic calls