xla
Fix dk/dv autograd error on TPU flash attention
#8685
Merged

Fix dk/dv autograd error on TPU flash attention #8685

qihqi merged 1 commit into pytorch:master from flash_attention_autograd_dkv_fix
zmelumian972
zmelumian972 Fix dk/dv grads not extracting on flash attention if autograd activat…
c695bd7a
lsy323
lsy323 approved these changes on 2025-02-06
qihqi
qihqi approved these changes on 2025-02-06
qihqi qihqi merged 0cd1fc2a into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone