Use pymongoarrow to get dataset results as dataframe (#2879)
* Adding initial test comparison for _get_df functions
* Fix style
* Try to fix CI
* Update worker dependencies
* Apply schema to avoid adding None data
* Fix orchestrator tests by fixing _id recognition
* Fix backfill tests by making status and priority categorical again