transformers
2cf87e2b - Prevent Dynamo graph fragmentation in GPTNeoX with torch.baddbmm fix (#24941)

Commit
2 years ago
Prevent Dynamo graph fragmentation in GPTNeoX with torch.baddbmm fix (#24941) * Pass a Python scalar for alpha in torch.baddbmm * fixup --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Author
Parents
Loading