Chenta/avoid thread local (#13003)
**Description**: the thread local implementation of stream pool
introduces complicated issue during shut down. to avoid that,
re-implement the stream pool at session level, so we can have better
control of the stream pool's lifetime.
Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>