Improve the quality of soft actor critic model (#645)
Summary:
- Fix the train batch size to be 256
- Add comments that eval batch size can't be adjusted
- This model doesn't support prefetching either, because the train data is dynamically generated at runtime according to the environment response
Profile of train, bs=256:

Profile of eval, bs=1:

Pull Request resolved: https://github.com/pytorch/benchmark/pull/645
Reviewed By: aaronenyeshi
Differential Revision: D33117564
Pulled By: xuzhao9
fbshipit-source-id: a3d07e5da0164eb932a20c10281d268730f4f3df