DeepSpeed
85132adc - enable starcode((kv_head=1)) autotp (#4896)

Commit
1 year ago
enable starcode((kv_head=1)) autotp (#4896) Hi, This PR is aim to enable starcode(kv_head=1) autotp. Please kindly review. Thanks~ Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Parents
Loading