tweaks to ds-attn, distilbert policy, and mup #2649
tweaks to ds-attn, distilbert policy, mup, etc.
d0d28bb5
jeffra
changed the title tweaks to ds-attn, distilbert policy, mup, etc. tweaks to ds-attn, distilbert policy, and mup 2 years ago
fixes for distilbert
c49b10cd
adjust kwargs to match position args for bert (hack)
da81b59e
cmikeh2
approved these changes
on 2022-12-28
jeffra
merged
d9b788d7
into master 2 years ago
jeffra
deleted the jeffra/ds-attn-tweaks branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub