Offsets textInfos: treat 32 bit unicode characters consuming two UTF-16 code units as one character instead of two (#8953)
* OffsetsTextInfo: When retrieving characters, support Unicode code points beyond 16 bits.
* Use constants for surrogates
* Review actions based on code snippet by @jcsteh
* Unit tests
* UTF-8 without BOM for tests
* Move range comments
* Update changes