Take an ascii oriented approach
Reduce the size of our instructions
Reduce the maximum program size
Optimize sparse set to reduce field reads by interlacing the sparse and dense ranges
With this `fast-glob` is still 4x as a fast for a single match, but only 2x as fast when 'precompiled' which is our usecase.