Prevent Dynamo graph fragmentation in GPTNeoX with torch.baddbmm fix #24941
Pass a Python scalar for alpha in torch.baddbmm
6339cf69
Merge branch 'main' into neox-norm-fix
38b3cbda
fixup
414cb5b7
sgugger
approved these changes
on 2023-08-23
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub