llvm
8e5fa1a5 - [CUDA] Revert to not tracking USM Shared allocations

Commit

33 days ago

[CUDA] Revert to not tracking USM Shared allocations After extensive testing with various approaches including: - Detecting memory type (Managed vs Device) and using different APIs - Using cuMemcpyPeerAsync for all cross-device copies - Stream synchronization before peer copies None of these approaches worked for Managed Memory cross-device copies. Current hypothesis: CUDA Managed Memory between GPUs may not support explicit memcpy operations the same way as CPU<->GPU. Reverting to let CUDA runtime handle Managed Memory migration automatically.

Author

kekaczma

Parents

bd224ebf

llvm 8e5fa1a5 - [CUDA] Revert to not tracking USM Shared allocations

llvm
8e5fa1a5 - [CUDA] Revert to not tracking USM Shared allocations