benchmark
3b538611 - Add new GPU arch supports for FMA count (#1568)

Commit

2 years ago

Add new GPU arch supports for FMA count (#1568) Summary: Function [ConvertSMVer2Cores](https://github.com/pytorch/benchmark/blob/main/components/model_analyzer/tb_dcgm_types/gpu_device.py#L114) will return the # of FMA units of asked arch. It needs to be synced with the upstream [`_ConvertSMVer2Cores`](https://github.com/NVIDIA/cuda-samples/blob/master/Common/helper_cuda.h) file. This PR adds the supports for SM 8.7(Ada) SM 9.0(H100). Pull Request resolved: https://github.com/pytorch/benchmark/pull/1568 Reviewed By: aaronenyeshi Differential Revision: D45230972 Pulled By: xuzhao9 fbshipit-source-id: 4ee3a1b3e205c7ed7c362ea4f54bc9bc87ab3d4d

Author

FindHao

Committer

facebook-github-bot

Parents

7c2d3eb0

benchmark 3b538611 - Add new GPU arch supports for FMA count (#1568)

benchmark
3b538611 - Add new GPU arch supports for FMA count (#1568)