DeepSpeed
673cb608 - Improve z3 trace management (#1916)

Commit
3 years ago
Improve z3 trace management (#1916) * Fix OOM and type mismatch * Toggle prefetching * Disable z3 prefetching for inference (temp workaround) * Fix zero3 tracing issues * Remove debug prints * Enable prefetch for inference * Code clarity * Invalidate trace cache * Trace cache invalidation when needed Separate nvme prefetch from all-gather prefetch * Track last used step id * Use debug name in error message * Construct param trace from module trace Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Author
Parents
Loading