Fixes #12673. `record_stream` in group offloading is not working properly (#12721)
* Fixes #12673.
Wrong default_stream is used. leading to wrong execution order when record_steram is enabled.
* update
* Update test
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>