pytorch
54b7c7d5 - Added requested_bytes to CUDA Caching Allocator Stats (#88575)

Commit

1 year ago

Added requested_bytes to CUDA Caching Allocator Stats (#88575) Summary: The caching allocator can be configured to round memory allocations in order to reduce fragmentation. Sometimes however, the overhead from rounding can be higher than the fragmentation it helps reduce. We have added a new stat to CUDA caching allocator stats to help track if rounding is adding too much overhead and help tune the roundup_power2_divisions flag: - "requested_bytes.{current,peak,allocated,freed}": memory requested by client code, compare this with allocated_bytes to check if allocation rounding adds too much overhead Test Plan: Added test case in caffe2/test/test_cuda.py Differential Revision: D40810674 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88575 Approved by: https://github.com/zdevito

Author

c-odrin

Committer

pytorchmergebot

Parents

dddc0b41

pytorch 54b7c7d5 - Added requested_bytes to CUDA Caching Allocator Stats (#88575)

pytorch
54b7c7d5 - Added requested_bytes to CUDA Caching Allocator Stats (#88575)