DeepSpeed
Fix cudaErrorInvalidConfiguration in attn_softmax() for large values of sequence_length*num_heads
#1239
Merged

Loading