Fix dk/dv autograd error on TPU flash attention #8685
Fix dk/dv grads not extracting on flash attention if autograd activat…
c695bd7a
lsy323
approved these changes
on 2025-02-06
qihqi
approved these changes
on 2025-02-06
qihqi
merged
0cd1fc2a
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub