DeepSpeed
d0b238a3 - Optimize preprocess for ragged batching (#4942)

Commit

2 years ago

Optimize preprocess for ragged batching (#4942) This PR improves efficiency of preprocessing for ragged batching. It is not efficient to iterate substituting values to tensor slices or copy/fill calls for small numbers of values. This PR records the values in python lists or primitives and copy them at once. --------- Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

References

#4942 - Optimize preprocess for ragged batching

Author

tohtana

Parents

29417ab5

DeepSpeed d0b238a3 - Optimize preprocess for ragged batching (#4942)

DeepSpeed
d0b238a3 - Optimize preprocess for ragged batching (#4942)