Add base support for ATEN Bool tensor type.
Added debug build support for TF and C++ tests.
Changed s_copy_() tests as currently remote TF XLA_CPU device has issues.
Fixed TriangularSolve to clear the layouts when adding shape's dimensions.
Fixed device data caching to be per-device.