transformers
49aff18f - Actually i think the attention casting only makes sense when we use torch.float16

Commit
3 years ago
Actually i think the attention casting only makes sense when we use torch.float16
Author
Parents
Loading