llvm
c5a16460 - [CUDA] Fix Managed Memory cross-device copy to work without P2P

Commit

37 days ago

[CUDA] Fix Managed Memory cross-device copy to work without P2P For NVIDIA A2 GPUs (GA107 chip) which lack P2P support: - Prefetch both SRC and DST Managed Memory to CPU before copy - CUDA driver automatically stages through host: GPU0→CPU→GPU1 - Detect cross-device copies using CU_POINTER_ATTRIBUTE_DEVICE_ORDINAL - Fix unused variable warning This enables multi-GPU tests to work on entry-level datacenter GPUs without NVLink/P2P, at reduced performance (host staging overhead).

References

usm-metadata-tracking

Author

kekaczma

Committer

kekaczma

Parents

21ff7a50

llvm c5a16460 - [CUDA] Fix Managed Memory cross-device copy to work without P2P

llvm
c5a16460 - [CUDA] Fix Managed Memory cross-device copy to work without P2P