Fix a minor issue in the domain score calculation. (#908)
Summary:
There is a bug in the current domain subscore calculation. We should use `filtered_tests` instead of the entire test set to calculate domain subscores.
Pull Request resolved: https://github.com/pytorch/benchmark/pull/908
Reviewed By: erichan1
Differential Revision: D36323645
Pulled By: xuzhao9
fbshipit-source-id: 833f109dcd03d46e009b287942c45752ebdeae4e