[inductor] Cast loads from boolean tensors to `tl.int1` (#110388)
Triton currently loads pointer to `tl.int1` as `tl.int8`.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/110388
Approved by: https://github.com/lezcano, https://github.com/Skylion007